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A SYSTEM FOR CKI J^RASKn SfTRFFNT^r: 



Cross Reference 

This appUcation claims priority to U.S. Provisional Applications for Patent 
Serial Nos. 60/122,152 (February 26, 1999), 60/123399 (March 8, 1999), 09/352,141, 
(July 12, 1999), 60/151.797 (August 31, 1999), 60/168,408 (December 1, 1999); and is 
a continuation in part of 09/430,656 (October 29, 1999); 09/398,965 filed September 
17. 1999 which is a continuation in part of Serial No. 09/031,271 filed February 27, 
1998 which is a continuation in part of U.S. Application S/N 08/810983, filed on 
February 27, 1997. 

Field of The Inventinn 

This invention is in the field of fluorescence-based ceU and molecular 
biochemical assays for drag discovety. 

Background of the Invention 

Drug discovery, as currently practiced in the art, is a long, multiple step process 
involving identification of specific disease targets, development of an assay based on a 
specific target, validation of the assay, optimization and automation of the assay to 
produce a screen, high throughput screening of compound Ubraries using the assay to 
identily "hits", hit validation and hit con^wund optimization. The output of this 
process is a lead compound that goes into pre-clinical and, if validated, eventuaUy into 
cUnical trials. In this process, the screening phase is distinct from the assay 
development phases, and involves testing compound efficacy in Uving biological 
systems. 

Historically, drug discovery is a slow and costly process, spanning numerous 
years and consuming hundreds of millions of doUars per drag created. Developments 
in the areas of genomics and higji throughput screening have insulted in mcreased 
capacity and efficiency in the areas of target identification and volume of compounds 
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Hisloridly. drug discovtry is a slow and «BUy process, spaiming nnmeioiK 
years and consuming hundreds of millions of doUare per dnrg created. Dcvdopmenls 
in the areas of genomics and high throughput screening have resulted in increased 
cap^dty and efficiency in the areas of target identification and volume of compounds 
5 screened. Significant advances in automated DMA sequencing, PCR plication, 
positional cloning. hybHdization an^ys, and Woinibrmatics tmc greatly increased thi 
number of genes (and gene fiagments) raicoding potential drug screening targets. 
However, the basic sohme for drug screening remians the same. 

Validation of genomic targets as points for therapeutic interventiou using the 
» exrstmg methods and protocols has become a bottleneck in the drug discovery process 
due to ttre slow, manual methods employed, such as v«, tecfional models 
fcnatonal analysis. f recombinant proteins, and stable cell line expression of candidate' 
genes. Primary DNA sequence data acquired through autom«ed sequencing does not 
p™. id^fflcatio. o, gene taction, but can provide infotmadon about common 
Wnts and specific gene homology when compared to Imown sequence databases 
0^0 methods such as subtraction hybridi^ion ^ RADE (r^d ampUficatio. of 
expression, can be used to ge^s d«. are up or down regulated in a 

d-sest^emodel. However, identificafionand validation sfiU proved downfl^same 
P^way some proteomio medrods use protein id^tificafi™ (global expression anays 
a> ctetroptarosis. combinatorial hbrarics) in combin«io. with reverse genetics to 

■d-fied by the bundrods ™thou. providing any information regarding type actiCir 

Z -^-^ ""'^ » Of DAS as dTg 

,s ^^om". and tirus extremely inefflcient, without fimctional dati. u. ZZl 
m^hamstic Uukwith disease. 1. is necessary, therefore, to provide new technoUl 

n,.d,yscreenDAS,oestahlishhiolo^calfimctiou.therebyimprovingtiugetv.HZ^^ 
and candidate optimization in drug discovery. g-vahdation 

n^ro are three maior avenues for irr^viug early drug discovery productivity 
F^-^ere .s a need for tools tita. provide increased information handl Tahiti!' 
Btot^ormatics has blossomed with the rapid development of DNA sequenclT^' 
--^evolutionoftiregenomicsdatabase. Genontics is hegimting". p ^ 
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role in the identification of potential new targets. Proteomics has become indispensible 
in relating structure and function of protein targets in order to predict drug interactions. 
However, the next level of biological complexity is the cell. Tharefoie, fliere is a need 
to acquire, manage and search multi-dimensional information from cells. Secondly, 
there is a need for higher throughput tools. Automation is a key to improving 
productivity as has aheady been demonstrated in DNA sequencing and high throughput 
primary screening. The instant invention provides for automated systems that extract 
multiple parameter information from ceUs that meet the need for higher throughput 
tools. The instant invention also provides for miniaturizing the methods, thereby 
allowing increased throughput, while decreasing the volumes of reagents and test 
compounds required in each assay. 

Radioactivity has been the dominant read-out in early drug discovery assays. 
However, the need for more information, higher throughput and miniaturization has 
caused a shift towards using fluorescence detection. Fluorescence-based reagents can 
yield more powerfiil, multiple parameter assays that are higher in throughput and 
information content and require lower volumes of reagents and test compounds. 
Fluorescence is also safer and less expensive than radioactivity-based methods. 

Screening of cells treated with dyes and fluorescent reagents is well known in 
the art. There is a considerable body of literature related to genetic engineering of cells 
to produce fluorescent proteins, such as modified green fluorescent protein (GFP), as a 
reporter molecule. Some properties of wild-type GFP are disclosed by Morise It al 
(Biochemistry 13 (1974). p. 2656-2662). and Ward et al. (Photochem. Photobiol 31 
(1980), p. 611-615). The GFP of the jellyfish Aeguorea victoria has an excitation 
maxmium at 395 mn and an emission maximum at 510 mn. and does not require an 
exogenous factor for fluorescence activity. Uses for GFP disclosed in the literature are 
widespread and include the study of gene expression and protein localization (Chalfie 
et al.. Science 263 (1994). p. 12501-12504)). as a tool for visualizing subcellular 
organelles (Rizzuto et al.. Curr. Biology 5 (1995), p. 635-642)). visualization of protein 
transport along the secretory pathway (Kaether and Genles, FEBS Letters 369 (1995) 
p. 267-271)), expression in plant cells (Hu and Cheng. FEBS Letters 369 (1995) p' 
331-334)) and Drosophila embryos (Davis et al.. Dev. Biology 170 (1995) p 726- 
729)), and as a reporter molecule fixsed to another protein of interest (U. S Patent 
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5.491,084). Similarly. W096/23898 relates to methods of detecting biologically active 
substances affecting intraceUular processes by utilizing a GFP constract having a 
protein kinase activation site. This patent, and all other patents referenced in this 
application are incorporated by reference in their oitirety 

5 Numerous references are related to GFP proteins in biological systems. For 

example. WO 96/09598 describes a system for isolating cells of interest utilizing the 
expression of a GFP like protein. WO 96/27675 describes the expression of GFP in 
plants. WO 95/21191 describes modified GFP protein expressed in transfonned 
organisms to detect mutagenesis. U. S. Patents 5.401.629 and 5.436.128 describe 
' assays and compositions for detecting and evaluating the intracellular transduction of 
an extracellular signal using recombinant cells that express cell surfece receptors and 
contain reporter gene constructs that include transcriptional regulatory elements that are 
responsrve to the activity of cell surface receptors. 

Performing a screen on many thousands of compounds requires parallel 
handlmg and processing of many compounds and assay component reagents. Standard 
high throughput screens ("HTS") use mixtures of compounds and biological reagents 
along vath some indicator compound loaded into arrays of wells in standard microtiter 
plates with 96 or 384 wells. The signal measured fiom each weU. either fluorescence 
enussron. optical density, or radioactivity, integrates the signal fiom all the material in 
the well giving an overall population average of all the molecules in the well 

Science Applications International Corporation (SAIC) 130 Fifth Avenue 
Seattle. WA. 98109) describes an imaging plate reader. Uris system uses a CCD 
camera to image the whole area of a 96 well plate. TTre image is analyzed to calculate 
the total fluorescence per well for all the material in the well. 

Molecular Devices, Inc. (Sumryvale. CA) describes a system (FLIPR) which 

us. low angle laser scanning illumination andamask to selectively excite fluo^^^^ 
withm approximately 200 microns of the bottoms of the wells in standard 96 well 
Plates in order to reduce background when imaging cell monolayers. TMs system uses 
CCD camera to image the whole area of the plate bottom. Although this system 

measur^ signals originating fiomacell monolayer at the bottom ofthe well, the signal 
measured is averaged over the area of the well and is therefore still considerTa 

measurementoftheaverageresponseofapopulationofcells.TT.e image is analyzed to 
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calculate the total fluorescoice per weU for ceU-based assays. Fluid deUvety devices 
have also been incorporated into cell based screening systems, such as the FLIPR 
system, in order to initiate a response, which is then observed as a whole well 
population average response using a macro-imaging system. 
5 In contrast to high throughput screens, various high-content screens ("HCS") 

have been developed to address the need for more detailed information about the 
temporal-spatial dynamics of cell constituents and processes, ffigh-content screens 
automate the extraction of multicolor fluorescence infonnation derived fiom specific 
fluorescence-based reagents incorporated into ceUs (Giuliano and Taylor (1995), Curr. 
10 Op. Cell Biol Giuliano et al. (1995) Ann. Rev. Biophys. Biomol. Struct. 24:405)^ 
Cells are analyzed using an optical system that can measure spatial, as well as temporal 
dynamics. (Faricas et al. (1993) Ann. Rev. Physiol 55:785; Giuliano et al. (1990) In 
Optical Microscopy for Biology. B. Herman and K. Jacobson (eds.), pp 543-557 
Wiley-Liss, New York; Hahn et al (1992) Nature 359:736; Waggoner et al (1996) 
.5 Huni.Pathol.2T.m). The concept is to treat each cell as a 'Veil" that has spatial and 
temporal information on the activities of the labeled constituents. 

THe types of biochemical and molecular infomiation now accessible through 
fluorescence-based reagents applied to cells include ion concentrations, .embr^e 
potential, specific translocations, enzyme activities, gene expression, as well as the 
Pi^ence. amounts and patterns of metabolites, proteins, lipids, carbohydr^es. and 
nuc e ae,d sequences (DeBiasio et al.. (1996)^./. Cell 7:1259;Giuliano et al 
imsUnn. Rev. Biopkys. BiomoL Struct. 24:405; Heim and Tsien. (1996) Curr. Biol. 

can beperformed on either fixed cells, using fluore^^^^^ 
. la^^eled antibodies, biological ligands. and/or nucleic acid hybridization probes, or livl 
cells usmg multicolor fluorescent indicators and Wnsors." H^e choice of fixed or 
hve cell screens depends on the specific cell-based assay required 

Fixed cell assays are the simplest, since an array of initially fiving cells in a 
-cromer plate fonnat can be treated with various compounds and doses being tested, 
then the cells can be fixed, labeled with specific reagents, and measured 2 

zrrT °' ' ^""^ ^^^^^ is 

acquired, but only at one time point. 7.e availability of thousands of antibodies 
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ligands and nucleic acid hybridization probes that can be qjplied to cells makes this an 
attractive approach for many types of cell-based screens. The fixation and labeling 
steps can be automated, allowing efficient processing of assays. 

Live cell assays are more sophisticated and powerful, since an airay of Uving 

5 cells containing the desired reagents can be screened over time, as well as space. 
Environmental control of the cells (temperature, humidity, and carbon dioxide) is 
required during measurement, since the physiological health of the ceUs must be 
maintained for multiple fluorescence measurements over time. There is a growing list 
of fluorescent physiological indicators and "biosensors" that can report changes in 

) biochemical and molecular activities within cells (Giuliano et al., (1995) Ann. Rev. 
Biophys. Biomol. Struct. 24:405; Hahn et al.. (1993) In Fluorescent and Luminescent 
Probes for Biological Activity. W.T. Mason, (ed.). pp. 349-359. Academic Press. San 
Diego). 

The availability and use of fluorescence-based reagents has helped to advance 
the development of both fixed and live cell high-content screens. Advances in 
instrumentation to automatically extract multicolor, high-content information has 
recently made it possible to develop HCS into an automated tool. An article by Taylor 
et al. iAmerican Scientist 80 (1992), p. 322-335) describes many of these methods and 
thar plications. For example. Proffitt et. al. (Cytometry 24: 204-213 (1996)) desa^be 
a semi-automated fluoroscence digital imaging system for quantifying relative cell 
number, m situ in a variety of tissue culture plate fom^ats. especiaUy 96-weU microtiter 
plates. The system consists of an epifluorescence inverted microscope with a 
motonzed stage, video camera, image intensifier, and a microcomputer with a PC 
Vrs.on digitizer. Turbo Pascal software controls the stage and scans the plate taking 
multipe^agesperwell. software calculates total fluorescence per well, provides 
for darly calibration, and configure easily for a variety of tissue culture plate formats 
Thr^holdmg of drgital images and reagents which fluoresce only when taken up by 
l-ng cells are used to reduce background fluorescence without removing excess 
fluorescent reagent 

221.3.1.319) and muWphoton micn,«.,pc taagfag (Deric a al., (1990) Scle,^ 
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248:73; Gratton et al., (1994) Proc. of the Microscopical Society of America, pp. 154- 
155) are also well established methods for acquiring high resolution images of 
microscopic samples. The principle advantage of these optical systems is the very 
shallow depth of focus, which allows features of limited axial extent to be resolved 
5 against the background. For example, it is possible to resolve internal cytoplasmic 
features of adherent cells fiom the features on the cell surface. Because scanning 
muWphoton imaging requires very short duration pulsed laser systems to achieve the 
high photon flux required, fluorescence lifetimes can also be measured in these systems 
(Lakowicz et al., (1992) Anal. Biochem. 202:316-330; Genittsen et al. (1997) J. of 
a Fluorescence 7:1 1-15)). providing additional capability for different detection modes 
Small, reliable and relatively inexpensive laser systems, such as laser diode pumped 
lasers, are now available to allow multiphoton confocal microscopy to be applied in a 
fairly routine fashion. 

A combmation of to biolosi<al hetemgenaly of clb in populattons (Bright, et 
al.. (1989). J. Cett. PkyM. UUAW. Giuli^. (,996) Cel, MM. Qtofa/ 35 237)) as 
well as the high spatial atxl temporal frequency of chemical and molecnta infcmtation 
P-ent wifltin cells, majces i, hnpossible to extnu. highK»nten. i^,i„n fr„„ 
pop„l«i„.s of cells using existing whole mic^tite, plate readen,. No existing high- 
conta,, screening platfonn has been designs! for mUticolor, tluotescence-hased 
^ using cells fta. ate analyzed in^vidually. Similarly, no method is cunenUy 
amiable that combines automated fluid delivety u> anays of cells for the putpose of 
^anahca^y sceening compounds for the abiBty ,„ induce a celluto tesponse that is 
.^Med by HCS analysis. e^«,a„y from cells ^ m mic,r.titer plates 
Funhennote, no method exists in the a„ combinmg high th^ughput well-by-„ell 

m««reme„,stoiden,ify.^its-inoneassayMow.dbyas.co.dhigheonten,ceU-by- 
oeU measurement on U» sante plate of only those wells identifled as hits. 

n.e instant invention provides sys.«ns, mefltods, and scteens that c^nbine high 
«-ughp„t sceening (HTS) and high content screening (HCS) that signifrcanUy 

frrrTr"°°°°'"^'^°''"""'^-'»''™«"™vce.isc.ee.^ 

fc™. wth nuorescence-based molecular reagents and eomputa-based feahn. 
-««,o„ data analysis, and «atoma.on. tesulSng in inceased quantity and speed of 
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data collection, shortened cycle times, and, ultimately, faster evaluation of promising 
drug candidates. The instant invention also provides for ininiaturisdng the methods, 
thereby allowing increased througlqjut, while decreasing tiie volumes of reagents and 
test compounds required in each assay. 



SUMMARY OF THE INVENTION 

In one aspect, the present invention relates to a method for analyzing ceUs 
comprising providing cells containing fluorescent reporter molecules in an array of 
locations, treating the ceUs in the array of locations with one or more reagents, 
imaging numerous cells in each location with fluorescence optics, converting tiie 
optical information into digital data, utilizing the digital data to determine the 
distribution, enviromnent or activity of the fluorescently labeled reporter molecules in 
the cells and the distiibution of the cells, and mterpreting that information in temis of a 
positive, negative or null effect of the compound being tested on the biological 
function 

In this embodiment, the method rapidly determines the distribution, 
enviromnent, or activity of fluorescentiy labeled reporter molecules in cells for the 
purpose of screening large numbers of compounds for tiiose that specifically affect 
particular biological functions. Tho array of locations may be a microtiter plate or a 
microchip which is a microplate having cells in an array of locations, m a preferred 
embodiment, the method includes computerized means for acquiring, processmg 
displaymg and storing the data received. In a preferred embodiment, the method 
further comprises automated, fluid delivery to the arrays of cells. In another preferred 
embodiment, the information obtained from high throughput measurements on the 
same plate are used to selectively perform high content screening on only a subset of 
the cell locations on the plate. 

In anotiier aspect of the present invention, a cell screening system is provided 
that comprises: 

• a high magnification fluorescence optical system having a microscope 
objective. 
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• an XY Stage adapted for holding a plate containing an array of cells and 
having a means for moving the plate for proper abgnment and focusing on 
the cell arrays; 

• a digital camera; 

'< • a light source having optical means for directing excitation light to cell 

arrays and a means for directing fluorescent Ught emitted fiom the cells to 
the digital camera; and 

• a computer means for receiving and processmg digital data fiom the digital 
camera wherein the computer means includes a digital fiame grabber for 
receiving the images from the camera, a display for user interaction and 
display of assay results, digital storage media for data storage and archiving, 
and a means for control, acquisition, processing and display of results. 

In a preferred embodiment, the cell screening system further comprises a 
computer screen operatively associated with the computer for displaying data. In 
another prefeired embodiment, the computer means for receiving and processing digital 
data from the digital camera stores the data in a bioinfonnatics data base. In a further 
preferred embodiment, the cell screening system further comprises a reader that 
measures a signal from many or all the wells in parallel. In another preferred 
embodiment, the ceU screening system further comprises a mechanical-optical means 
for changing the magnification of the system, to allow changing modes between high 
throughput and high content screening, m another prefeired embodiment, the cell 
screening system further comprises a chamber and control system to maintain the 
temperature, CO. concentration and humidity surrounding the plate at levels to 
keep cells alive, m a fimher prefen^d embodiment, the cell scr^g system utili^s a 
confocal scanning illumination and detection system. 

In another aspect of the present invention, a machine readable storage medium 
compnsmg a program containing a set of instructions for causing a ceU scr^„i„g 
system to execute procedures for defining the distribution and activity of specific 
cellular constituents and processes is provided. In a prefe^ed embodiment, the cell 
screemng system comprises a high magnification fluorescence optical system with a 
stage adapted for holding cells and a means for moving the stage, a digital camera, a 

9 
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light source for receiving and processing the digital data fix)m the digital camera, and a 
computer means for receiving and processing the digital data fix)m the digital camera. 
Preferred embodiments of the machine readable storage medium comprise programs 
consisting of a set of instructions for causing a ceU screening system to execute the 
procedures set forth in Figures 9, 1 1. 12, 13. 14 or 15. Another preferred embodiment 
comprises a program consisting of a set of instructions for causing a cell screening 
system to execute procedures for detecting the distribution and activity of specific 
cellular constituents and processes. In most preferred embodiments, the cellular 
processes include, but are not limited to. nuclear translocation of a protein, cellular 
hypertrophy, apoptosis, and protease-induced translocation of a protein. 

In another preferred embodiment, a variety of automated cell screening methods 
are provided, including screens to identify compounds that affect transcription factor 
activity, protein kinase activity, cell morphology, microtubule structure, apoptosis, 
receptor internalization, and protease-induced translocation of a protein. 

In another aspect, the present invention provides recombinant nucleic acids 
aicoding a protease biosensor, comprising: 

a. a first nucleic acid sequence that encodes at least one detectable 
polypqjtide signal; 

b. a second nucleic acid sequence that encodes at least one protease 
recognition site, wherein the second nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal- 



and 



c. a third nucleic acid sequence that encodes at least one reactant target 
sequence, wherein the third nucleic acid sequence is operatively linked to the second 
nucleic acid sequence that encodes the at least one protease recognition site. 

The present invention also provides the recombinant expression vectors capable 
of expressmg the recombinant nucleic acids encoding protease biosensors, as well as 
genetically modified host cells that are (ransfected with the expression vectors. 

The invention fiuther provides recombinant protease biosensors, comprising 
a. a first domain comprising at least one detectable polypeptide signal- 

c a SllT" protease'reSStion "C^d 
c. a third domamcompnsmg at least one reactant target sequence; 
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wherein the first domain and the third domain are separated by the second 
domain. 

In a further aspect, the present invention involves assays and reagents for 
characterizing a sample for the presence of a toxin. The method comprises the use of 
detector, classifier, and identifier classes of toxiii biosensore to provide for various 
levels of toxin characterization. 
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30 



BRIEF DKSr RIPTTON OF T HE PR AWTVng 

Figure 1 shows a diagram of the components of the cell-based scanning system. 

0 Figure 2 shows a schematic of the microscope subassembly. 
Figure 3 shows the camera subassembly. 

Figure 4 illustrates cell scanning system process. 

Figure 5 illustrates a user interface showing major functions to guide the user. 
Figure 6 is a block diagram of the two platform arohitecture of the Dual Mode System 

1 for Cell Based Screening in which one platfonn uses a telescope lens to read all wells 
of a microtiter plate and a second platfonn that uses a higher magnification lens to read 

individual cells in a well. 

Fi,ur. 7 is a detail of an optical sy«™ for a single platfom archi«otu« of fte Dual 
Mode System for Cel, Based Sceedng .hat uses a tnoveable -.eleseope- lens to read all 
wells of a n-ierotiter plate and a moveable Higher nugnificadon lens to read individual 

cells in a well. 

ng-tre , is an illustration of tte Htnd de,iv«y systen, for ac^g Bnetie data on the 

<-eiJ Based Screening System. 

Rgnre 9 is a fl„„ chart of processing step for the cell-based scanning system 
ngnre 10 iUustiates the stnttegy of fl,e Nuclear T^loeation Assay 

« Bal'; "^"^ -PS in the Dual Mode Systetn for 

Based Screenn-g combining high thn,ughp„, and high content scemng of 

microtiter plates. ^ 

^.r.l2isaf,o„chandeflningtt,ep„eessings.=psh,.hcffigh™„.^,„^,^ 
the System for Cell Based Screening. 

System for Cell Based Screening. 
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Figure 14 is a flow chart defining the processing steps required for acquiring kinetic 
data in the High Content mode of the System for Cell Based Screening. 
Figure IS is a flow chart defining the processing steps perfonned within a well during 
the acquisition of kinetic data. 

Figure 16 is an example of data fix>m a known inhibitor of translocation. 
Figure 17 is an example of data fiom a known stimulator of translocation. 
Figure 18 iUustrates data presentation on a graphical display. 

Figure 19 is an iUustration of the data firom the High Throughput mode of the System 
for Cell Based Screenmg, an example of the data passed to the High Content mode, the 
data acquired in the high content mode, and the results of the analysis of that data. 
Figure 20 shows the measurement of a drag-induced cytoplasm to nuclear 

translocation. 

Figure 21 iUustrates a graphical user interface of the measurement shown in Figure 20. 
Figure 22 illustrates a graphical user interface, with data presentation, of the 

measurement shown in Fig. 20. 

Figure 23 is a graph representing the kinetic data obtained from the measurements 
depicted in Fig. 20. 

Figure 24 details a high-content screen of drag-induced apoptosis. 
Figure 25. Graphs depicting changes in morphology upon induction of apoptosis 
Staurosporine (A) and pachtaxel (B) induce classic nuclear fragmentation in L929 cells 
BHK cells exhibit concentration dependent changes in response to staurosporine (C) 
but a more classical response to pacUtaxel (D). MCF-7 ceUs exhibit either nucJ 
condensation (E) or fi^gmentation (F) in response to staurosporine and paclitaxel 
respectively. In all cases. ceUs were exposed to the compounds for 30 hour. 
Figure 26 iUustrates the dose response of cells to staurosporine in tenns of both nuclear 
size and nuclear perimeter convolution. 

Figure 27. Graphs depicting induction of apoptosis by staurosporine and paclitaxel 
leadmg to changes in peri-nuclear f-actin content (A. B) Both apoptotic stimulators 
mduce dose-dependent increases in f-actin content in L929 cells. (C) In BHK ceUs 
staurosporine induces a dose-dependent increase in f-actin. whereas paclitaxel (D)' 
produces results that are more variable. (E) MCF-7 ceUs exhibit either a decrease or 
mcrease dependmg on the concentration of staurosporine. (F) Paclitaxel induced 
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changes in f-actin content were highly variable and not significant Cells were exposed 
to the compounds for 30 hours. 

Figure 28. Graphs depicting mitochondrial changes in response to induction of 
apoptosis. L929 (A3) and BHK (C J)) cells responded to both stauiosporine (A,C) and 
paclitaxei (B J)) with increases in mitochondrial mass. MCF-7 cells exhibit either a 
decrease in membrane potential (E, staurosporine) or an mcrease in mitochondrial mass 
(F, paclitaxei) depending on the stimulus. Cells were exposed to the compounds for 30 
hours, 28G is a graph showing die sunultaneous measurement of staurosporine effects 
on mitochondrial mass and mitochondrial potential in BHK cells. 
Figure 29 shows the nucleic acid and amino acid sequence for various types of 
protesae biosensor domains. (A) Signal sequences, (B) Protease recognition sites. (C) 
Product/Reactant target sequences 

Figure 30 shows schematically shows some basic organization of domains in the 
protease biosensors of the invention. 

Figure 31 is a schematic diagram of a specific 3-domain protease biosensor. 
Figure 32 is a photograph showing the effect of stimulation of apoptosis by cis-platin 
on BHK cells transfected with an expression vector that expresses the caspase 
biosensor shown in Figure 32. 

Figure 33 is a schematic diagram of a specific 4-domain protease biosensor. 

Figure 34 is a schematic diagram of a specific 4-domain protease biosensor, containing 

a nucleolar localization signal. 

Figure 35 is a schematic diagram of a specific 5-domain protease biosensor. 
Figure 36 shows the differential r^onse in a dual labeling assay of the p38 MAPK 
and NF-kB pathways across three model toxms and two difiFerent cell types. 
Treatments mariced with an asterisk are difiFerent from controls at a 99% confidence 
level (p< 0.01). 

DETAILED DESCRIPTION OF THE INVENTION 

All cited patents, patent ^plications and other references are hereby 
incorporated by reference in their entirety. 

As used herein, the following terms have the specified meaning: 
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Markers ofcelMar domains. Luminescent probes that have high afiBnity for 
specific cellular constituents including specific organelles or molecules. These probes 
can either be small luminescent molecules or fluorescenUy tagged macromolecules 
used as "labeling reagents", "environmental indicators", or "biosrasois." 
5 Labeling reagents. LabeUng reagents include, but are not limited to, 

luminescently labeled macromolecules including fluorescent protein analogs and 
biosensors, luminescent macromolecular chimeras including those fonned with the 
green fluorescent protein and mutants thereof, luminescenUy labeled primary or 
secondary antibodies that react with cellular antigens mvolved in a physiological 
10 response, luminescent stains, dyes, and other small molecules. 

Markers ofcelMar translocations. LuminescenUy tagged macromolecules or 
organelles that move from one cell domain to another during some cellular process or 
physiological response. Translocation markcni can either simply report location 
relative to the maikers of cellular domains or they can also be "biosensois" that report 
1 5 some biochemical or molecular activity as weU. 

Biosensors. Macromolecules consisting of a biological fimctional domain and a 
luminescent probe or probes that report the environmental changes that occur either 
intemally or on their surface. A class of luminescently labeled macromolecules 
designed to sense and report these changes have been termed "fluorescent-protein 
20 biosensors". The protein component of the biosensor provides a highly evolved 
molecular recognition moiety. A fluorescent molecule attached to the protein 
component in the proximity of an active site transduces environmental changes into 
fluorescence signals that are detected using a system with an appropriate temporal and 
spatial resolution such as the cell scamiing system of the present invention. Because 
the modulation of native protein activity widiin the living cell is reversible, and because 
fluorescent-protein biosensors can be designed to sense reversible changes in protein 
activity, these biosensors are essentially reusable. 

Disease associated sequences ("DAS"). This term refers to nucleic acid 
sequences identified by standard techniques, such as primary DNA sequence data, 
genomic methods such as subtraction hybridization and RADE, and proteomic methods 
in combination with reverse genetics, as being of drug candidate compounds. The term 
does not mean that the sequence is only associated with a disease state. 
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High content screening (HCS) can be used to measure the effects of drugs on 
complex molecular events such as signal transduction pathways, as well as cell 
functions including, but not Umited to. apoptosis, cell division, cell adhesion, 
locomotion, exocytosis. and cell-cell communication. Multicolor fluorescence permits 
multiple targets and cell processes to be assayed in a single screen. Cross-correlation 
of cellular responses will yield a wealth of infonnation required for target validation 
and lead optimization. 

In one aspect of the present invention, a cell screening system is provided 
comprising a high magnification fluorescence optical system having a microscope 
objective, an XY stage adqjted for holding a plate with an array of locations for 
holding cells and having a means for moving the plate to align the locations with the 
microscope objective and a means for moving the plate in the direction to effect 
focusing; a digital camera; a light source having optical means for directing excitation 
light to cells in the array of locations and a means for directing fluorescent light emitted 
fi-om the cells to the digital camera; and a computer means for receiving and processing 
digital data fiom the digital camera wherein the computer means includes: a digital 
frame grabber for receiving the images from the camera, a display for user interaction 
and display of assay results, digital storage media for data storage and archiving, and 
means for control, acquisition, processing and display of results. 

Figure 1 is a schematic diagram of a preferred embodiment of the ceU scamiing 
system. An inverted fluorescence microscope is used 1 such as a Zeiss Axiovert 
inverted fluorescence microscope which uses standard objectives with magnification of 
1-lOOx to the camera, and a white Ught source (e.g. lOOW mercuiy-arc lamp or 75W 
xenon lamp) with power supply 2. There is an XY stage 3 to move the plate 4 in the 
XY direction over the microscope objective. A Z-axis focus drive 5 moves the 
objective in the Z direction for focusing. A joystick 6 provides for manual movement 
of the stage in the XYZ direction. A high resolution digital camera 2 acquires images 
&om each well or location on the plate. There is a camera power supply 8. an 
automation controller 2 and a central processing unit 10. n,e PC H provides a display 
12 and has associated software. The printer 13 provides for printing of a hard copy 
record. 
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Figure 2 is a schematic of one embodiment of the microscope assembly 1 of the 
invention, showing in more detail the XY stage i Z-axis focus drive 5, joystick 4 light 
source 2, and automation controller 9. Cables to the computer 15 and microscope 16. 
respectively, are provided. In addition. Figure 2 shows a 96 well microtiter plate J2 
which is moved on the XY stage 1 in the XY direction. Light from the Ught source 2 
passes through the PC controlled shutter 18 to a motorized filter wheel 19 with 
excitation filters 20. The light passes into filter cube 25 which has a dichroic miiror 26 
and an emission filter 22. Excitation light reflects off the dichroic mirror to the wells in 
the microtiter plate 17 and fluorescent Ught 28 passes through the dichroic miiror 2fi 
and the emission filter 22 and to the digital camera 1. 

Figure 3 shows a schematic drawing of a preferred camera assembly. The 
digital camera 7, which contains an automatic shutter for exposure control and a power 
supply 31. receives fluorescent Ught 28 ftom the microscope assembly. A digital cable 
30 transports digital signals to the computer. 

The standard optical configurations described above use microscope optics to 
direcUy produce an enlarged unage of the specimen on the camera sensor in order to 
c^ture a high resolution image of the specimea This optical system is commonly 
referred to as 'wide field' microscopy. Those skilled in the art of microscopy will 
recognize that a high resolution image of the specimen can be created by a variety of 
other optical systems, including, but not Umited to, standard scamiing confocal 
detection of a focused point or line of illumination scamied over the specimen (Go et al. 
1997, supra), and multi-photon scanning confocal microscopy (Denk et al., 1990, 
sivra), both of which can form images on a CCD detector or by synchronous' 
digitization of the analog output of a photomultiplier tube. 

In screening appUcations, it is often necessary to use a particular cell Une, or 
primary cell culture, to take advantage of particular feahires of those cells. TTiose 
skilled in the art of cell culture will recognize that some cell lines are contact inhibited, 
meaning that they will stop growing when they become surrounded by other ceUs.' 
while other cell lines will continue to grow under those conditions and the cells will 
literaUy pile up, forming many layers. An example of such a ceU line is the HEK 293 
(ATCC CRL-1573) Une. An optical system that can acquire images of single cell 
layers in multilayer preparations is required for use widi ceU Unes that tend to form 
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layers. The large depth of field of wide field microscopes produces an image that is a 
projection through the many layers of cells, making analysis of subcellular spatial 
distributions extremely difficult in layer-forming cells. Alternatively, the very shaUow 
depth of field that can be achieved on a confocal microscope, (about one micron), 
allows discrimimition of a single cell layer at high resolution, simplifying thi 
determination of the subcellular spatial distribution. Similarly, confocal imaging is 
preferable when detection modes such as fluorescence lifetime imaging are required. 

The output of a standard confocal imaging attachment for a microscope is a 
digital image that can be converted to the same format as the images produced by the 
other cell screening system embodiments described above, and can therefore be 
processed in exactly the same way as those images. The overall control, acquisition 
and analysis in this embodiment is essentiaUy the same. The optical configuration of 
the confocal microscope system, is essentially the same as that described above, except 
for the iUuminator and detectors, niumination and detection systems r^uhcd for 
confocal microscopy have been designed as accessories to be attached to standard 
microscope optical systems such as that of the present invention (Zeiss. Germany). 
These alternative optical systems therefore can be easily integrated into the system as 
described above. 

Figure 4 illustrates an alternative embodiment of the invention in which cell 
arrays are in microwells 4Q on a microplate 4L described ion co-pending U.S. 
Application S/N 08/865,341. incorporated by reference herein in its entirety. Typically 
the microplate is 20 mm by 30 mm as compar«l to a standard 96 well microtiter plate 
which is 86 mm by 129 mm. The higher density array of cells on a micmplate allows 
the microplate to be imaged at a low resolution of a few microns per pixel for high 
throughput and particular locations on the microplate to be imaged at a higher 
resolution of less than 0.5 microns per pixel. These two resolution modes help to 
improve the overall throughput of the system. - 

The microplate chamber 42 serves as a microfluidic delivery system for the 
addition of compomids to ceUs. The microplate 41 in the microplate chamber ^ is 
placed in an XY microplate r^er 43. Digital data is processed as described above. 
ITie small size of this microplate system increases throughput, minimizes reagent 
volume and allows control of the distribution and placement of cells for fast and precise 

17 



wo 00/50872 



PCT/USOO/04794 



cell-based analysis. Processed data can be displayed on a PC screen U and made part 
of a bioinfonnatics data base M- This data base not only pennits storage and retrieval 
of data obtained through the methods of this invention, but also pennits acquisition and 
storage of external data relating to cells. Figure 5 is a PC display which illustrates the 
opoation of the software. 

In an alternative embodiment, a high dirougtq)ut system (HTS) is directiy 
coupled with the HCS either on the same platform or on two separate platfonns 
connected electronicaUy (e.g. via a local area networic). This embodiment of the 
invention, referred to as a dual mode optical system, has the advantage of increasing the 
throughput of a HCS by coupling it with a HTS and thereby requiring sloww high 
resolution data acquisition and analysis only on the small subset of wells that show a 
response in the coupled HTS. 

High throughput 'whole plate' reader systems are well known in the art and are 
commonly used as a component of an HTS system used to screen large numbers of 
compounds (Beggs (1997), J. ofBiomolec. Screening Macaffiey et al., (1996) 

J. Biomolec. Screening 1:187-190). 

In one embodiment of dual mode cell based screening, a two platform 
architecture in which high throughput acquisition occurs on one platfonn and high 
content acquisition occurs on a second platform is provided (Figure 6). Processing 
occurs on each platform independenUy, with results passed over a network interface, or 
a single controller is used to process the data ftom both platforms. 

As illustrated in Figure 6, an exemplified two platform dual mode optical 
system consists of two light optical instruments, a high throughput platform 6Q and a 
high content platform 65, which read fluorescent signals emitted fiom cells cultured in 
microtiter plates or microwell arrays on a microplate, and communicate with each other 
via an electronic connection 64- The high throughput platform 6Q analyzes aU the wells 
in the whole plate either in paraUel or rapid serial fashion. Those skilled in the art of 
screening will recognize that there are a many such commercially available high 
throughput reader systems that could be integrated into a dual mode cell based 
screening system (Topcount (Packard Instruments, Meriden, CT); Spectramax, 
Lumiskan (Molecular Devices, Sumiyvale, CA); Fluoroscan (Labsystems, Beverly' 
MA)). The high content platform ^ as described above, scans fiom well to well and 
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acquires and analyzes high resolution image data collected from individual cells within 
a well. 

The HTS software, residing on the system's computer 62, controls the high 
throughput instrument, and results are displayed on the monitor 61. The HCS software, 
residing on it's computer system 67, controls the high content instrument hardware 65, 
optional devices (e.g. plate loader, environmental chamber, fluid dispenser), analyzes 
digital image data from the plate, displays results on the monitor 66 and manages data 
measured in an integrated database. The two systems can also share a single computer, 
in which case all data would be collected, processed and displayed on that computer, 
without the need for a local area network to transfer the data. Microtiter plates are 
transfared from the high throughput system to the high content system 63 eithear 
manually or by a robotic plate transfer device, as is well known in the art (Beggs 
(1997), supra; Mcaffrey (1996), supra). 

In a preferred embodiment, the dual mode optical system utilizes a single 
platfoim system (Figure 7). It consists of two separate optical modules, an HCS 
module 203 and an HTS module 2^ that can be independenUy or collectively moved 
so that only one at a time is used to collect data from the microtiter plate 201. The 
microtiter plate 201 is mounted in a motorized X.Y stage so it can be positioned for 
imaging in either HTS or HCS mode. After collecting and analyzing the HTS image 
data as described below, the HTS optical module 209 is moved out of tiie optical path 
and the HCS optical module 203 is moved into place. 

The optical module for HTS 209 consists of a projection lens 214, excitation 
wavelength filter 213 and dichroic mirror 210 which are used to iUuminate the whole 
bottom of tiie plate witii a specific wavelength band torn a conventional microscope 
lamp system (not Ulustrated). The fluorescence emission is collected through the 
dichroic mirror 210 and emission wavelength filter 211 by a lens 212 which forms an 
image on the camera 216 with sensor 215. 

The optical module for HCS 203 consists of a projection lens 2Q8, excitation 
wavelengtii filter 202 and dichroic minor 204 which are used to illuminate the back 
aperture of the microscope objective 202. and thereby the field of that objective, from a 
standard microscope illumination system (not shown). The fluorescence emission is 
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collected by the microscope objective 202, passes through the dichroic miiror 204 and 
emission wavelength filter 2Q1 and is focused by a tube lens 20fi which fonns an image 
on the same camera 216 with sensor 215 . 

In an alternative embodiment of the present invention, the cell screening system 
further comprises a fluid delivery device for use with the live cell embodiment of the 
method of cell screening (see below). Figure 8 exemplifies a fluid delivery device for 
use widi die system of the invention. It consists of a bank of 12 syringe pumps 701 
driven by a single motor drive. Each syringe 702 is sized according to the volume to be 
delivered to each well, typically between 1 and 100 »iL. Each syringe is attached via 
flexible tubing 703 to a similar bank of connectors which accept standard pipette tips 
705. The bank of pipette tips are attached to a drive system so they can be lowered and 
raised relative to the microtiter plate 706 to deliver fluid to each well. The plate is 
mounted on an X,Y stage, allowing movement relative to the optical system 702 for 
data collection purposes. This set-up allows one set of pipette tips, or even a single 
pipette tip, to deliver reagent to aU the wells on the plate. The bank of syringe pumps 
can be used to deliver fluid to 12 wells simultaneously, or to fewer wells by removing 
some of the tips. 

In another aspect, the present invention provides a method for analyzing cells 
comprising providing an anay of locations which contain multiple cells wherein the 
cells contain one or more fluorescent reporter molecules; scanning multiple cells in 
each of the locations containing cells to obtain fluorescent signals from the fluorescent 
reporter molecule in the cells; converting the fluorescent signals into digital data; and 
utilizing the digital data to detennine the distribution, environment or activity of the 
fluorescent reporter molecule within the cells. 

Cell Arrays 

Screening large numbers of compounds for activity with respect to a particular 
biological function requires preparing arrays of cells for parallel handling of cells and 
reagents. Standard 96 weU microtiter plates which are 86 mm by 129 mm. with 6mm 
diameter wells on a 9mm pitch, arc used for compatibility with current automated 
loading and robotic handling systems. The microplate is typically 20 mm by 30 mm. 
with cell locations that are 100-200 microns in dimension on a pitch of about 500 
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microns. Methods for making microplates are described in U.S. Patent implication 
Serial No. 08/865341, incorporated by reference herein in its entirety. Microplates 
may consist of coplanar layers of materials to which cells adhere, patterned with 
materials to which cells will not adhere, or etched 3-dimensional sur&ces of similarly 
pattered materials. For the purpose of the following discussion, the temis 'well' and 
'microweU' refer to a location in an array of any construction to which ceUs adhere and 
within which the cells are imaged. Microplates may also include fluid delivery 
channels in the spaces between the wells. The smaller format of a microplate increases 
the overall efficiency of the system by minimizing the quantities of the reagents, 
storage and handling during preparation and the overall movement required for the 
scanning operation. In addition, the whole area of the microplate can be imaged more 
efficiently, allowing a second mode of operation for the microplate reader as described 
later in this document. 
Fluorescence Reporter Molecules 

A major component of the new drug discovery paradigm is a continually 
growing family of fluorescent and luminescent reagents that are used to measure the 
temporal and spatial distribution, content, and activity of intracellular ions, metabolites, 
macromolecules, and organelles. Classes of these reagents include labeling reagents 
that measure the distribution and amount of molecules in living and fixed cells, 
environmental indicators to report signal transduction events in time and space, and 
fluorescent protein biosensors to measure target molecular activities within living ceUs. 
A multiparameter approach that combines several reagents in a single cell is a powerful 
new tool for drug discoveiy. 

The method of the present invention is based on the high affinity of fluorescrat 
or luminescent molecules for specific ceUular components. The affinity for specific 
components is governed by physical forces such as ionic interactions, covalent bonding 
(which includes chimeric fusion with protein-based chromophores. fluorophores, and 
lumiphores). as well as hydrophobic interactions, electrical potential, and, in some 
cases, simple entrapment within a cellular component ITie luminescent probes can be 
small molecules, labeled macromolecules, or genetically engineered proteins, 
including, but not limited to green fluorescent protein chimeras. 
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Those skiUed in this art wiU recognize a wide variety of fluorescent reporter 
molecules that can be used in the present invention, including, but not limited to, 
fluorescently labeled biomolecules such as proteins, phospholipids and DNA 
hybridizing probes. Similarly, fluorescent reagents specificaUy synthesized with 
particular chemical properties of binding or association have been used as fluorescent 
reporter molecules (Barak et al.. (1997), /. Biol. Chem. 272:27497-27500; Southwick et 
al., (1990). Cytometry 11:418-430; Tsien (1989) in Methods in Cell Biology, Vol. 29 
Taylor and Wang (eds.). pp. 127-156). Fluorescentiy labeled antibodies are particularly 
useful reporter molecules due to their high degree of specificity for attaching to a single 
molecular target in a mixture of molecules as complex as a cell or tissue. 

The luminescent probes can be synthesized within the living cell or can be 
transported into the cell via several non-mechanical modes including diffusion, 
facilitated or active transport, signal-sequence-mediated transport, and endocytotic or 
pinocytotic uptake. Mechanical bulk loading methods, which arc well known in die art, 
can also be used to load luminescent probes into living cells (Barber et al. (1996), 
Neuroscience Letters IdlM-lQ- Bright et al. (1996), Cytometry 24:226-233; McNeil 
(1989) in Methods in Cell Biology. Vol. 29, Taylor and Wang (eds.). pp. 153-173). 
These methods include electroporation and other mechanical methods such as scrape- 
loading, bead-loading, impact-loading, syringe-loading, hypertonic and hypotonic 
loading. Additionally, cells can be genetically engineered to express reporter 
molecules, such as GFP, coupled to a protein of interest as previously described 
(Chalfie and Prasher U.S. Patent No. 5.491,084; Cubitt et al. (1995), Trends in 
Biochemical Science 20:448-455). 

Once in the cell, the luminescent probes accumulate at their target domam as a 
result of specific and high affinity interactions with the target domain or other modes of 
molecular targeting such as signal-sequence-mediated transport. Fluorescently labeled 
reporter molecules are usefij for determining the location, amount and chemical 
environment of the reporter. For example, whether the reporter is in a Upophilic 
membrane enviromnent or in a more aqueous emiromnent can be detennined (Giuliano 
et al. (1995), Ann. Rev. of Biophysics and Biomolecular Structure 24:405-434; Giuliano 
and Taylor (1995). Methods in Neuroscience 27:1-16). The pH enviromnent of the 
reporter can be determined (Bright et al. (1989). J. Cell Biology 104:1019-1033- 

22 



wo 00/50872 



PCT/USOO/04794 



Giuliano et al. (1987), Anal. Biochem. 167:362-371; Thomas et al. (1979), 
Biochemistry 18:2210-2218). It can be detennined whether a reporter having a 
chelating groiq) is bound to an ion, such as Ca-H-, or not OBright et al. (1989), In 
Methods in Cell Biology, Vol. 30, Taylor and Wang (eds.), pp. 157-192; Shimoura et al. 
(1988), J. of Biochemistry (Tokyo) 251:405-410; Tsien (1989) In Methods in Cell 
Biology, Vol. 30, Taylor and Wang (eds.), pp. 127-156). 

Furthermore, certain cell types within an organism may contam components 
that can be specifically labeled that may not occur in other cell types. For example, 
epithehal cells often contain polarized membrane components. That is, these cells 
asymmetrically distribute macromolecules along their plasma membrane. Connective 
or supporting tissue cells often contain granules in which are trapped molecules specific 
to that cell type (e.g., heparin, histamine, serotonin, etc.). Most muscular tissue cells 
contain a sarcoplasmic reticulum, a specialized organelle whose fimction is to regulate 
the concentration of calcium ions within the cell cytoplasm. Many nervous tissue cells 
contain secretory granules and vesicles in which are trapped neurohormones or 
neurotransmitters. Therefore, fluorescent molecules can be designed to label not only 
specific components withm specific cells, but also specific cells within a population of 
mixed cell types. 

Those skilled in the art will recognize a wide variety of ways to measure 
fluorescence. For example, some fluorescent reporter molecules exhibit a change in 
excitation or emission spectra, some exhibit resonance energy transfer where one 
fluorescent reporter loses fluorescence, while a second gains in fluorescence, some 
exhibit a loss (quenching) or appearance of fluorescence, while some report rotational 
movements (Giuliano et al. (1995), Ann. Rev. of Biophysics and Biomol. Structure 
24:405-434; Giuliano et al. (1995), Methods in Neuroscience 27:1-16). 
Scanning cell arrays 

Referring to Figure 9, a preferred embodiment is provided to analyze cells fliat 
comprises operator-directed parameters being selected based on the assay being 
conducted, data acquisition by the ceU screening system on the distribution of 
fluorescent signals within a sample, and interactive data review and analysis. At the 
start of an automated scan the operator enters information 100 that describes flie 
sample, specifies the filter settings and fluorescent channels to match the biological 
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labels being used and the infonnation sought, and then adjusts the camera settings to 
match the sample brightness. For flexibility to handle a range of samples, the software 
allows selection of various parameter settings used to identify nuclei and cytoplasm, 
and selection of difierent fluorescent reagents, identification of cells of interest based 
on morphology or brightness, and cell numbers to be analyzed per well. Hiese 
parameters are stored in the system's for easy retrieval for each automated run. TTie 
system's interactive cell identification mode simplifies the selection of morphological 
parameter limits such as the range of size, shape, and intensity of cells to be analyzed. 
The user specifies which wells of the plate the system will scan and how many fields or 
how many cells to analyze in each well. Depending on the setup mode selected by the 
user at step lOL the system either automaticaUy pre-focuses the region of the plate to 
be scanned using an autofocus procedure to "find focus" of the plate m or the user 
interactively pre-focuses 103 the scanning region by selecting three "tag" points which 
define the rectangular area to be scanned. A least-squares fit "focal plane model" is 
then calculated fix)m tiiese tag points to estimate the focus of each well during an 
automated scan. The focus of each well is estimated by inteipolating from the focal 
plane model during a scan. 

During an automated scan, the software dynamically displays the scan status, 
including tiie number of ceUs analyzed, the current well being analyzed, images of each 
independent wavelength as tfiey are acquired; and the result of die screen for each well 
as it is determined. The plate 4 (Figure 1) is scamied in a serpentine style as tiie 
software automatically moves tiie motorized microscope XY stage 1 from weU to well 
and field to field within each well of a 96-weU plate. Those skilled in tiie programming 
art will recognize how to adapt software for scamiing of otiier microplate formats such 
as 24, 48, and 384 well plates. The scan pattern of the entire plate as well as tiie scan 
pattern of fields within each well are programmed. The system adjusts sample focus 
with an autofocus procedure m (Figure 9) through flie Z axis focus drive 5. controls 
filter selection via a motorized filter wheel 19, and acquires and analyzes images of up 
to four diffCTent colors ("channels" or ' Vavelengths"). 

The autofocus procedure is called at a user selected frequency, typicaUy for the 
first field in each well and then once every 4 to 5 fields witiiin each well. The autofocus 
procedure calculates the starting Z-axis point by inten,olating from the pr^culated 
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plane focal model. Starting a programmable distance above or below this set point, the 
procedure moves the mechanical Z-axis through a number of different positions, 
acquires an image at each position, and finds the maximum of a calculated focus scoi^ 
that estimates the contrast of each image. The 2 position of the image with the 
maximum focus score determines the best focus for a particular field. Those skilled 
the art will recognize this as a variant of automatic focusing methods as described __ 
Harms et al. in Cytometry 5 (1984). 236-243, Groen et al. in Cytometry 6 (1985). 81-91, 
and Firestone et aL in Cytometry 12 (1991), 195-206. 

For image acquisition, the camera's exposure time is sq)arately adjusted for 
each dye to ensure a high-quality image fiom each chamiel. Software procedures can be 
called, at the user's option, to correct for registration shifts between wavelengths by 
accounting for linear (X and Y) shifts between wavelengths before making any further 
measurements. Hie electronic shutter 18 is controlled so that sample photo-bleaching is 
kept to a minimum. Background shading and uneven illumination can be corrected by 
the software using methods known in the art (Bright et al. (1987). J. Cell Biol. 
104:1019-1033). 

In one channel, images are acquired of a primary maricer 105 (Figure 9) 
(typically cell nuclei comiterstained with DAPI or PI fluorescent dyes) which are 
segmented ("identified") using an adaptive thresholding procedure. The adaptive 
thresholding precedure 106 is used to dynamically select the threshold of an image for 
separating cells fiom the background. The staining of cells with fluorescent dyes can 
vary to an unknown degree across ceUs in a microtiter plate sample as well as within 
images of a field of cells within each well of a microtiter plate. ITus variation can occur 
as a result of sample preparation and/or the dynamic nature of cells. A global threshold 
IS calculated for the complete image to separate the cells from background and account 
for field to field variation. These global adaptive techniques are variants of those 
described in the art (Kittler et al. in Computer Vision. Graphic, and Image 
Processing 30 (1985). 125-147. Ridler et al. in ZE££ W. Systems. Man. and 
Cybernetics (1978), 630-€32.) 

An alternative adaptive thresholding method utilizes local region thresholding 
m contrast to global image thresholdmg. Image analysis of local regions leads to better 
ovendl segmentation since staining of cell nuclei (as well as other labeled components) 
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can vaiy across an image. Using this global/local procedure, a reduced resolution 
image (reduced in size by a factor of 2 to 4) is first globally segmented (using adaptive 
thresholding) to find regions of interest in the image. These regions then serve as 
guides to more fiiUy analyze the same regions at full resolution. A more localized 
threshold is then calculated (again using adaptive thresholding) for each region of 
interest. 

The ou^ut of the segmentation procedure is a binary image wherein flie objects 
are white and the background is black. This binary image, also called a mask in the art. 
is used to determine if the field contains objects 102. The mask is labeled with a blob 
labehng method whereby each object (or blob) has a unique number assigned to it 
Morphological features, such as area and shape, of the blobs are used to differentiate 
blobs likely to be cells from those that are considered artifects. The user pre-sets the 
morphological selection criteria by either typing in known cell morphological features 
or by using the interactive training utility. If objects of interest are found in the field, 
images are acquired for aU other active channels M, otherwise the stage is advanced 
to the next field 1(22 in tiie current well. Each object of interest is located in the image 
for fimher analysis HO- TTie software determines if the object meets the criteria for a 
valid cell nucleus HI by measuring its morphological features (size and shape). For 
each valid cell, the XYZ stage location is recorded, a small image of the cell is stored, 
and features are measured 112 . 

The cell scanning method of the present invention can be used to perform many 
different assays on cellular samples by applying a number of analytical methods 
simultaneously to measure feahires at multiple wavelengths. An example of one such 
assay provides for the following measurements: 

1. The total fluorescent intensity within the ceU nucleus for colors 1-4 

2. The area of tiie cell nucleus for color 1 (the primary maricer) 

3. The shape of the cell nucleus for color 1 is described by three shape 
features: ^ 

a) perimeter squared area 

b) box area ratio 

c) height width ratio 

TTie average fluorescent intensity within the cell nucleus for colors 1-4 (i e 
#1 divided by #2) ^ 
The total fluorescent intensity of a ring outside the nucleus (see Figure 10) 
ti^reprraents fluorescence of Uie cell's cytoplasm (cytoplasmic mask) for 
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6. The area of the cytoplasmic mask 

7. The average fluorescent intensity of the cytoplasmic mask for colors 2-4 
(i.e. #5 divided by #6) 

8. The ratio of the average fluorescrait intensity of the cytoplasmic mask to 
average fluorescent intensity within the cell nucleus for colore 2-4 (i.e. #7 
divided by #4) 

9. The difference of the avoage fluorescent intensity of the cytoplasmic mask 
and the average fluorescent intensity within the cell nucleus for colors 2-4 
(i.e. #7 minus #4) 

10. The number of fluorescent domains (also call spots, dots, or grains) within 
the cell nucleus for colors 2-4 
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Features 1 through 4 are general features of the different cell screening assays 
of the invention. These stqjs are commonly used m a variety of image analysis 
applications and are well known m art (Russ (1992) The Image Processing Handbook, 
CRC Press Inc.; Gonzales et al. (1987), Digital Image Processing. Addison- Wesley 
Publishing Co. pp. 391-448). Features 5-9 have been developed specifically to provide 
measurements of a cell's fluorescent molecules within the local cytoplasmic region of 
the cell and the translocation (i.e. movement) of fluorescent molecules from the 
cytoplasm to the nucleus. These features (steps 5-9) are used for analyzing cells in 
microplates for the inhibition of nuclear translocation. For example, inhibition of 
nuclear translocation of transcription factore provides a novel ^proach to screening 
intact cells (detailed examples of other types of screens will be provided below). A 
specific method measures the amount of probe in the nuclear region (feature 4) versus 
the local cytoplasmic region (feature 7) of each cell. Quantification of the difference 
between these two sub-cellular compartments provides a measure of cytoplasm-nuclear 
translocation (feature 9). 

Feature 10 describes a screen used for counting of DNA or RNA probes within 
the nuclear region in colore 2-4. For example, probes are commerciaUy available for 
identifying chromosome-specific DNA sequences (Life Technologies, Gaithersburg, 
MD; Genosys, Woodlands, TX; Biotechnologies, Inc., Richmond, CA; Bio 101, Inc., 
Vista, CA) Cells are three-dimensional in nature and when examined at a high 
magnification under a microscope one probe may be in-focus while another may be 
completely out-of-focus. The cell screening method of the present invention provides 
for detecting three-dimensional probes in nuclei by acquiring images from multiple 
focal planes. The software moves the Z-axis motor drive 5 (Figure 1) in small steps 
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where the step distance is user selected to account for a wide range of different nuclear 
diameters. At each of the focal steps, an image is acquired. The maximum gray-level 
intensity from each pixel in each image is found and stored in a resulting maximum 
projection image. The maximum projection image is then used to count the probes. The 
above method works well in counting probes that are not stacked directly above or 
below another one. To account for probes stacked on top of each other in the Z- 
direction, users can select an option to analyze probes in each of the focal planes 
acquired. In this mode, the scanning system performs the maximum plane projection 
method as discussed above, detects probe regions of interest in this image, then further 
analyzes these regions in all the focal plane images. 

After measuring ceU features 112 (Figure 9), the system checks if there are any 
unprocessed objects in the current field 113. If there are any unprocessed. objects, it 
locates the next object HQ and determines whether it meets the criteria for a valid ceU 
nucleus m, and measures its features. Once all the objects in the current field are 
processed, the system determines whether analysis of the current plate is complete 114; 
if not, it determines the need to find more cells in the current well 115. If the need 
exists, the system advances the XYZ stage to the next field within the current well 102 
or advances the stage to the next well 116 of the plate. 

After a pl^e scan is complete, images and data can be reviewed with the 
system's image review, data review, and suibnary review facilities. All images, data, 
and settings firom a scan are archived in the system's database for later review or for 
interfacing with a network information management system. Data can also be exported 
to other third-party statistical packages to tabulate results and generate other reports. 
Users can review the images alone of every cell analyzed by the system with an 
interactive image review procedure U?. The user can review data on a cell-by-cell 
basis using a combination of interactive graphs, a data spreadsheet of measured 
features, and images of all the fluorescence channels of a ceU of interest with the 
interactive cell-by-cell data review procedure US- Graphical plotting capabilities are 
provided in which data can be analyzed via interactive graphs such as histograms and 
scatter plots. Users can review summary data that are accumulated and summarized for 
all cells within each well of a plate with an interactive weU-by-well data review 
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procedure 112. Hard copies of graphs and images can be printed on a wide range of 
standard printers. 

As a final phase of a complete scan, reports can be generated on one or more 
statistics of the measured features. Users can generate a graphical report of data 
summarized on a well-by-weU basis for the scanned region of the plate using an 
interactive report generation procedure 120. This report includes a summary of the 
statistics by well in tabular and graphical format and identification information on the 
sample. The report window allows the opaator to enter comments about the scan for 
later retrieval. Multiple reports can be generated on many statistics and be printed with 
the touch of one button. Reports can be previewed for placement and data before being 
printed. 

The above-recited embodiment of the method operates in a single high 
resolution mode referred to as the high content screening (HCS) mode. The HCS mode 
provides sufficient spatial resolution within a weU (on the order of 1 jmi) to define the 
distribution of material withm the well, as weU as within individual cells in the well. 
The high degree of information content accessible in that mode, comes at the expense 
of speed and complexity of the required signal processing. 

In an alternative embodiment, a high throughput system (HTS) is direcfly 
coupled with the HCS either on the same platform or on two separate platforms 
connected electronically (e.g. via a local ai«a network). This embodiment of the 
invention, referred to as a dual mode optical system, has the advantage of increasing the 
throughput of an HCS by coupling it with an HTS and thereby requiring slower high 
resolution data acquisition and analysis only on the small subset of wells that show a 
response in the coupled HTS. 

High throughput 'whole plate' reader systems are well known in the art and are 
commonly used as a component of an HTS systan used to screen large nmnbers of 
compounds (Beggs et al. (1997), supra; McCaffiey et al. (1996), supra ). The HTS of 
the present invention is carried out on the microtiter plate or microweU array by reading 
many or all wells m the plate simultaneously with sufiBcient resolution to make 
determinations on a well-by-well basis. That is. calculations are made by averaging the 
total signal output of many or aU the cells or the bulk of the material in each weU. 
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Wells that exhibit some defined response in the HTS (the 'hits') are flagged by the 
system. Then on the same microtiter plate or microwell array, each well identified as a 
hit is measured via HCS as described above. Thus, the dual mode process involves: 

1. Rapidly measuring numerous wells ofa microtiter plate or microwell anay, 

2. hiterpreting the data to determine the overall activity of fluorescently labeled 
reporter molecules in the ceUs on a well-by-well basis to identify "hits" fwells that 
exhibit a defined response), v "loi 

3. Imaging numerous cells in each "hit" well, and 

4. Interpreting the digital image data to determine the distribution, envittmment or 
activity of the fluorescently labeled reporter molecules in the individual cells fi e 
intracellular measurements) and the distribution of the cells to test for specific 
biological fimctions *^ 

In a preferred embodiment of dual mode processing (Figure 1 1), at the start of a 
run 20L the operator enters information m that describes the plate and its contents, 
specifies the filter settings and fluorescent channels to match the biological labels being 
used, the information sought and the camera settings to match the sample brightness. 
These parameters are stored in the system's database for easy retrieval for each 
automated nm. The microtiter plate or microwell array is loaded into the cell screening 
system 302 either manually or automatically by controlling a robotic loading device. 
An optional environmental chamber 304 is controUed by the system to maintain the 
temperature, humidity and CCh levels in the air surrounding live cells in the microtiter 
plate or microwell array. An optional fluid delivery device 305 (see Figure 8) is 
controlled by the system to dispense fluids into the weUs during the scan. 

High throughput processing 306 is first performed on the microtiter plate or 
microwell array by acquiring and analyzing the signal fiom each of the wells in the 
plate. The processing performed in high throughput mode 302 is illustrated in Figure 12 
and described below. Wells that exhibit some selected intensity response in diis high 
throughput mode ("hits") are identified by the system. The system performs a 
conditional operation that tests for hits. If hits are found, those specific hit wells are 
fiirther analyzed in high content (micro level) mode m The processing performed in 
high content mode HI is illustrated in Figure 13. The system then updates m the 
informatics database Mi with results of the measurements on the plate. If there are 
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more plates to be analyzed 313 the system loads the next plate 303; otherwise the 
analysis of the plates terminates 314. 

The foUowing discussion describes the high throughput mode iUustrated in 
Figure 12. The prefeired embodiment of the system, the single platform dual mode 
screening system, will be described. Those skilled in the art will recognize that 
operationally the dual platform system simply involves moving the plate between two 
optical systems rather than moving the optics. Once the system has been set up and the 
plate loaded, the system begins the HTS acquisition and analysis 401. The HTS optical 
module is selected by controlling a motorized optical positioning device 402 on the 
dual mode system. In one fluorescence channel, data from a primary marker on the 
plate is acquired 403 and wells are isolated from the plate backgromid using a masking 
procedure 404. Images are also acquired in other fluorescence channels being used 405. 
The region in each image corresponding to each well 406 is measured 402. A featuiB 
calculated from the measurements for a particular well is compared with a predefined 
threshold or intensity response m and based on the result the well is either flagged as 
a 'Tiif 4^ or not. The locations of the wells flagged as hits are recorded for 
subsequent high content mode processing. If there are wells remaining to be processed 
410 the program l6ops back 4Q6 untU all the wells have been processed 4U and the 
system exits high throughput mode. 

Following HTS analysis, the system starts the high content mode processing 
501 defined in Figure 13. The system selects the HCS optical module m by 
controlling the motorized positioning system. For each •'hit" well identified in high 
throughput mode, the XY stage location of the weU is retrieved from memory or disk 
and the stage is then moved to the selected stage location m The autofocus procedure 
504 is called for the first field in each hit well and then once every 5 to 8 fields within 
each well. In one channel, images are acquired of the primary marker SQS. (lypicaUy 
cell nuclei counterstained with DAPI. Hoechst or PI fluorescent dye). The images are 
then segmented (separated into regions of nuclei and non-nuclei) using an adaptive 
thresholding procedure m The output of the segmentation procedure is a binary mask 
wherein the objects are white and the backgromid is black. This binary image, also 
called a mask in the art, is used to determine if the field contains objects m The mask 

31 



wo 00/50872 



PCT/US00/O47M 



is labeled with a blob labeling method whereby each object (or blob) has a unique 
number assigned to it If objects are found in the field, images are acquired for aU other 
active channels Sfig. otherwise the stage is advanced to the next field 514 in the current 
well. Each object is located in the image for further analysis 509. Moiphological 
features, such as area and sh^e of the objects, are used to select objects Ukely to be 
cell nuclei m and discard (do no further processing on) those that are considered 
artifacts. For each valid cell nucleus, the XYZ stage location is recorded, a small image 
of the cell is stored, and assay specific features are measured 511. The system then 
performs multiple tests on the cells by applying several analytical methods to measure 
features at each of several wavelengths. After measuring the cell features, the systems 
checks if there are any unprocessed objects in the current field 512. If there are any 
unprocessed objects, it locates the next object 509 and determines whether it meets the 
criteria for a valid cell nucleus 510, and measures its features. After processing all the 
objects in the current field, the system deteremines whether it needs to find more cells 
or fields in the current well 513- If it needs to find more cells or fields in the current 
weU it advances the XYZ stage to the next field within the current well 515 . 
Otherwise, the system checks whether it has any remaining hit wells to measure 511. If 
so, it advances to the next hit well 501 and proceeds through another cycle of 
acquisition and analysis, otherwise the HCS mode is finished 516. 

In an alternative embodiment of the present invention, a method of kinetic live 
cell screening is provided. The previously described embodhnentsofthe invention are 
used to characterize the spatial distribution of cellular components at a specific point in 
time, the time of chemical fixation. As such, these embodiments have limited utility 
for unplementing kinetic based screens, due to the sequential nature of the image 
acquisition, and the amount of time required to read all the wells on a plate. For 
example, since a plate can require 30 - 60 minutes to read through all the wells, only 
very slow kinetic processes can be measured by simply preparing a plate of live cells 
and then reading through all the wells more than once. Faster kinetic processes can be 
measured by taking multiple readings of each weU before proceeding to the next well, 
but the elapsed time between the first and last well would be too long, and fast kinetic 
processes would likely be conq)lete before reaching the last weU. 
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The kinetic live cell extension of the invention enables the design and use of 
screens in which a biological process is characterized by its kinetics instead of; or in 
addition to. its spatial characteristics. In many cases, a response in live cells can be 
measured by adding a reagent to a specific well and making multiple measurements on 
that well with the appropriate timing. This dynamic live ceU embodiment of the 
invention therefore includes apparatus for fluid delivery to individual wells of the 
system in order to deUver reagents to each well at a specific time in advance of reading 
the well. This embodiment thereby allows kinetic measurements to be made with 
temporal resolution of seconds to minutes on each weU of the plate. To improve the 
overall efficiency of the dynamic live cell system, the acquisition control program is 
modified to allow repetitive data collection fiom sub-regions of the plate, allowing the 
system to read other wells between the time points required for an individual well. 

Figure 8 describes an example of a fluid delivery device for use with the live 
cell embodiment of the invention and is described above. This set-up allows one set of 
pipette tips m or even a single pipette tip. to deliver reagent to all the weUs on the 
plate. The bank of syringe punqjs 701 can be used to deliver fluid to 12 wells 
simultaneously, or to fewer wells by removing some of the tips TQS. The temporal 
resolution of the system can therefore be adjusted, without sacrificing data collection 
efficiency, by changing the number of tips and the scan pattern as follows. Typically, 
the data collection and analysis fix)m a single well takes about 5 seconds. Moving fiom 
well to well and focusing in a well requires about 5 seconds, so the overaU cycle time 
for a well is about 10 seconds. Therefore, if a single pipette tip is used to deUver fluid 
to a single weU. and data is collected repetitively fiom that well, measurements can be 
made widi about 5 seconds temporal resolution. If 6 pipette tips ai« used to deliver 
fluids to 6 wells simultaneously, and the system repetitively scans all 6 wells, each scan 
will require 60 seconds, thereby estabUshing the temporal resolution. For slower 
processes which only require data collection every 8 minutes, fluids can be delivered to 
one half of the plate, by moving the plate during the fluid deliveiy phase, and flien 
repetitively scamiing that half of tiie plate. Therefore, by adjusting the size of the sub- 
region being scaraied on the plate, the temporal resolution can be adjusted without 
having to insert wait times between acquisitions. Because the system is continuously 
scanning and acquiring data, die overall time to coUect a kinetic data set fiom the plate 
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is then simply the time to perfonn a single scan of the plate. multipUed by the number 
of time points required. Typically. 1 time point before addition of compounds and 2 or 
3 time points following addition should be sufficient for screening purposes. 

Figure 14 shows the acquisition sequence used for kinetic analysis. The start of 
processing 801 is configuration of the system, much of which is identical to the 
standard HCS configuration. In addition, the operator must enter infonnation specific 
to the kinetic analysis being performed 802, such as die sub-region size, the number of 
time points required, and the required time increment A sub-region is a group of wells 
that will be scanned repetitively in order to accumulate kinetic data. The size of the 
sub-region is adjusted so that the system can scan a whole sub-region once during a 
single time increment, thus minimizing wait times. The optimum sub-region size is 
calculated torn the setup parameters, and adjusted if necessary by the operator. The 
system then moves the plate to the first sub-region 803, and to the first well in that sub- 
region 804 to acquire the prestimulation (time = 0) time points. The acquisition 
sequence perfomied in each well is exactly the same as that required for the specific 
HCS being run in kinetic mode. Figure 15 details a flow chart for that processing. All 
of the steps between the start 901 and the return m are identical to those described as 
stqjs 504 - 514 in Figure 13. 

After processing each well in a sub-region, the system checks to see if all the 
wells in the sub-region have been processed m (Figure 14). and cycles through all the 
wells until the whole region has been processed. The system then moves the plate into 
position for fluid addition, and controls fluidic system deliveiy of fluids to the entire 
sub-region 802. This may require multiple additions for sub-regions which span 
several rows on the plate, with the system moving the plate on the X,Y stage between 
additions. Once the fluids have been added, the system moves to the fir^ weU in the 
sub-region 8Q8 to begin acquisition of time points. The data is acquired ftom each weU 
8Q9 and as before the system cycles through all the wells in the sub-region 810. After 
each pass through the sub-region, the system checks whether aU the time points have 
been collected m and if not, pauses SB if necessary 812 to stay synchronized with the 
requested time increment Otherwise, the system checks for additional sub-regions on 
the plate SH and either moves to the next sub-region 803 or finishes 815. Thus, the 
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kinetic analysis mode comprises operator identification of sub-regions of the microtiter 
plate or microwells to be screened, based on the kinetic response to be investigated, 
with data acquisitions within a sub-region prior to data acquisition in subsequent sub- 
regions. 

Specific Screens 

In another aspect of the present invention, cell screening methods and machine 
readable storage medium comprising a program containing a set of instructions for 
causing a cell screening system to execute procedures for defining the distribution and 
activity of specific cellular constituents and processes is provided. In a preferred 
embodiment, the cell screening system comprises a high magnification fluorescence 
optical system with a stage adapted for holding cells and a means for moving the stage, 
a digital camera, a light source for receiving and processing the digital data from thJ 
digital camera, and a computer means for receiving and processing the digital data from 
the digital camera. This aspect of the invention comprises programs that instruct the 
cell screening system to define the distribution and activity of specific cellular 
constituents and processes, usmg the luminescent probes, the optical unaging system, 
and the pattern recognition software of the invention. Preferred embodhnents of the' 
machine readable storage medium comprise programs consisting of a set of instructions 
for causing a cell screening system to execute the procedures set forth in Figures 9, 1 1, 
12, 13. 14or 15. Anodier preferred embodiment comprises a program consisting of i 
set of instructions for causing a cell screening system to execute procedures for 
detecting the distribution and activity of specific cellular constituents and processes. In 
most preferred embodiments, tiie cellular processes include, but are not limited to. 
nuclear translocation of a protein, cellular morphology, apoptosis. receptor 
internalization, and protease-induced translocation of a protein. 

In a preferred embodiment, the cell screening methods are used to identify 
compounds that modify the various ceUular processes. The cells can be contacted with 
a test compound, and the effect of the test compound on a particular ceUular process 
can be analyzed. Alternatively, the cells can be contacted with a test compound and a 
known agent that modifies the particular cellular process, to determine whether the test 
compound can inhibit or enhance the effect of the known agent. Thus, the methods can 
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be used to identify test compounds that inc^e or decease a particular ceUular 

response, as weU as to identify test confounds that affects the ability of other 
mcrease or decrease a particular cellular response. 

In another preferred embodiment, the locations containing cells are analyzed 
5 usmg the above methods at low resolution in a high throughput mode, and only a subset 
of the locations containing cells are analyzed in a high content mode to obtam 
lummescent signals fiom the luminescently labeled reporter molecules in subceUular 
compartments of the cells being analyzed. 

The foUowing examples are intended for purposes of illustration only and 
. should not be construed to limit the scope of the invention, as defined in the claims 
appended hereto. 

•me various chemical compounds, reagents, dyes, and antibodies that ai^ 
referred to in the following Examples a.. commen:ially available fiom such sources as 
Sigma Chemical (St Louis. MO). Molecular Probes (Eugene, OR). Aldrich Chemical 
Company (Milwaukee. WI). Accurate Chemical Company (Westbury. NY). Jackson 
Immunolabs, and Qontech (Palo Alto, CA). 

Example I Cytoplasm to Nucleus Translocation Screening: 

a- Transcription Factors 

Regulation of transcription of some genes mvolves activation of a transcription 
factor in the cytoplasm, resulting in that factor being transported into the nucleus wher. 
it mitiate transcription of a particular gene or genes, ^s change in transcription 
factor d.stnbution is the basis of a screen for the cell-based scr^g system to detect 
compounds that inhibit or induce ^cription of a particular gene or group of genes 
A general description of the screen is given foUowed by a specific example 

The distribution of the transcription factor is determined by labeling the nuclei 
wrth a DNA specific fluorophore like Hoechst 33423 and the transcription fector with a 
specific fluorescent antibody. After autofocusing on the Hoechst labeled nuclei an 
rrrmge of the nuclei is acquired in the cell-based screening system and used to cr^ a 
mask by one of several optional thresholding methods, as described supra Tto 
morphological descriptors of the regions defined by the mask are compared with the 
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user defined parameten; and valid nuclear masks are identified and used with the 
following method to extract transcription factor distributions. Each valid nuclear mask 
is eroded to define a slightly smaller nuclear region. The original nuclear mask is th«, 
dilated in two steps to define a ring shaped region around tiie nucleus, which represents 
a cytoplasmic region. The average antibody fluorescence in each of tiiese two regions 
is detennined, and the difference between these averages is defined as the NucCyt 
Difference. Two examples of determining nuclear translocation are discussed below 
and illustrated in Figure lOA^. Figure lOA iUustiates an unstimulated cell with its 
nucleus 200 labeled witi, a blue fluorophore and a tianscription factor m the cytoplasm 
201 labeled with a green fluorophore. Figure lOB illustiates tiie nuclear mask 202 
derived by the cell-based screening system. Figure IOC illustiates tiie cytoplasm 203 
of the unstimulated cell imaged at a green wavelengtii. Figure lOD illustrates the 
nuclear mask m is eroded (reduced) once to define a nuclear sampling region 204 
witii minimal cytoplasmic distoibution. TTie nucleus boundary 202 is dilated (expanded) 
several times to fonn a ring tiiat is 2-3 pixels wide that is used to define the 
cytoplasmic sampling region 205 for tiie same cell. Figure lOE fiirther illustiates a side 
view which shows the nuclear sampling region 204 and tiie cytoplasmic sampling 
region 205. Using tiiese two sampling regions, datii on nuclear translocation can be 
automatically analyzed by tiie cell-based screening system on a cell by cell basis. 
Figure lOF-J illustrates tiie strategy for ' determining nuclear translocation in a 
stimulated cell. Figure I OF illustrates a stimulated cell witii its nucleus 206 labeled wifli 
a blue fluorophore and a transcription fector in tiie cytoplasm 207 labeled witii a green 
fluorophore. The nuclear mask m in Figure lOG is derived by tiie cell based 
screening system. Figure lOH illustrates die cytoplasm 2Q9 of a stimulated ceU imaged 
at a green wavelengtii. Figure 101 illustrates flie nuclear sampling region 2U and 
cytoplasmic sampling region 212 of tiie stimulated cell. Figure lOJ fimher illustrates a 
side view which shows flie nuclear sampUng region 2n and tiie cytoplasmic sampling 
region 212. 

A specific application of tiiis metiiod has been used to validate tiiis metiiod as a 
screen. A human cell line was plated in 96 weU microtiter plates. Some rows of wells 
were titrated witii IL-l. a known inducer of tiie NF-KB transcription factor. The ceUs 
were tiien fixed and stained by standard mefliods witii a fluorescein labeled antibody to 
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the tianscription factor, and Hoechst 33423. Tlie cell-based screening system was used 
to acquire and analyze images fiom this plate and the NucCyt Difference was found to 
be strongly correlated with the amount of agonist added to the wells as illustrated in 
Figure 16. In a second experiment, an antagonist to the receptor for IL-1, EL-lRA was 
titrated in the presence of IL-la, progressively inhibiting the translocation induced by 
IL-la. The NucCyt Difference was found to strongly correlate with this inhibition of 
translocation, as illustrated in Figure 17. 

Additional experiments have shown that the NucCyt Difference, as well as the 
NucCyt ratio, gives consistent results over a wide range of cell densities and reagent 
concentrations, and can therefore be routinely used to screen compound Ubraries for 
specific nuclear translocation activity. Furthemiore, the same method can be used with 
antibodies to other transcription factors, or GFP-transcription fector chimeras, or 
fluorescently labeled transcription factors introduced into living or fixed cells, to screen 
for effects on the regulation of transcription factor activity. 

Figure 18 is a representative display on a PC screen of data which was obtained 
in accordance with Example 1. Graph 1 180 plots the difference between the average 
antibody fluorescence in the nuclear sampling region and cytoplasmic sampling region, 
NucCyt Difference verses WeU #. Graph 2 iSl plots the average fluorescence of the 
antibody in the nuclear sampling region. NPl average, versus the Well #. Graph 3 182 
plots the average antibody fluorescence in the cytoplasmic sampling region, LIPl 
average, versus Well #. The software permits displaying data fiom each cell. For 
example. Figure 18 shows a screen display 183, the nuclear image 184. and the 
fluorescent antibody image ig^ for cell #26. 

NucCyt Difference referred to in graph 1 180 of Figure 18 is the difference 
between the average cytoplasmic probe (fluorescent reporter molecule) intensity and 
the average nuclear probe (fluorescent reporter molecule) intensity. NPl average 
referred to in graph 2 m of Figure 18 is the average of cytoplasmic probe (fluorescent 
reporter molecule) intensity within the nuclear sampUng region. LIPl average referred 
to in graph 3 182 of Figure 18 is the average probe (fluorescent reporter molecule) 
intaisity within the cytoplasmic sampUng region. 

It will be understood by one of skill in the art that this aspect of the invention 
can be perfonned using other transcription factors that translocate from the cytoplasm 
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to the nucleus upon activation. In another specific example, activation of the c-fos 
transcription factor was assessed by defining its spatial position within cells. Activated 
c-fos is found only within the nucleus, while inactivated c-fos resides within the 
cytoplasm. 

3T3 cells were plated at 5000-10000 cells per well in a Polyfiltronics 96-weU 
plate, nie cells were allowed to attach and grow overnight The cells were rinsed 
twice with 100 Ml serum-free medium, incubated for 24-30 hours in serum-free MEM 
culture medium, and then stimulated with platelet derived growth factor (PDGF-BB) 
(Sigma Chemical Co., St Louis, MO) diluted directly into serum fiw medium at 
concentrations ranging from 1-50 ng/ml for an average time of 20 minutes. 

Following stimulation, cells were fixed for 20 minutes in 3.7% formaldehyde 
solution in IX Hanks buffered saline solution (HBSS). After fixation, the cells were 
washed with HBSS to remove residual fixative, permeabilized for 90 seconds with 
0.5% Triton X-100 solution in HBSS. and washed twice with HBSS to remove residual 
detergent The cells were then blocked for 15 minutes with a 0.1% solution of BSA in 
HBSS. and further washed with HBSS prior to addition of diluted primary antibody 
solution. 

c-Fos rabbit polyclonal antibody (Calbiochem, PC05) was diluted 1:50 in 
HBSS, and 50 jil of the dilution was applied to each well. CeUs were incubated in the 
presence of primary antibody for one hour at reom temperature, and then incubated for 
one hour at room temperature in a light tight container with goat anti-rabbit secondary 
antibody conjugated to ALEXA™ 488 (Molecular Probes), diluted 1:500 fiom a 100 
ng/ml stock in HBSS. Hoechst DNA dye (Molecular Probes) was then added at a 
1:1000 dilution of the manufacturer's stock solution (10 mg/ml). The cells were then 
washed with HBSS. and the plate was sealed prior to analysis with the ceU screening 
system of the mvention. TTie data from these experiments demonstrated that the 
methods of the invention could be used to measure transcriptional activation of c-fos by 
defining its spatial position within cells. 

One of skill in the art will recognize that while the following method is applied to 
detection of c-fos activation, it can be appUed to the analysis of any transcription factor 
that translocates from the cytoplasm to the nucleus upon activation. Examples of such 
transcription factors include, but are not limited to fos and jun homologs. NF-KB 
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(nuclear factor kappa fiom B ceUs), NFAT (nuclear factor of activated T-lymphocytes), 
and STATs (signal transducer and activator of transcription) factors (For example, see 
Strehlow, I., and Schindler, C. 1998. J. Biol. Chem. 273:28049-28056; Chow, et al. 
1997 Science. 278:1638-1641; Ding et al. 1998 J. Biol Chem. 273:28897-28905; 
Baldwin, 1996. Annu Rev Immunol. 14:649-83; Kuo, C.T., and J.M. Leidea 1999. 
Annu Rev Immunol. 17:149-87; Rao, et al. 1997. Annu Rev Immunol. 15:707-47; 
Masuda,et al. 1998. Cell Signal. 10:599-611; Hoey, T., and U. Schindler. 1998. Curr 
Opin Genet Dev. 8:582-7; Liu. et aL 1998. Curr Opin Immunol. 10:271-8.) 

Thus, in this aspect of the invention, indicator cells are treated with test 
compounds and the distribution of luminescently labeled transcription factor is 
measured in space and time using a cell screening system, such as the one disclosed 
above. The luminescently labeled transcription factor may be expressed by or added to 
the cells either before, togeflier with, or after contacting the cells widi a test compound. 

For example, the transcription factor may be expressed as a luminescently 
labeled protein chimera by transfected indicator cells. Alternatively, the luminescentiy 
labeled transcription factor may be expressed, isolated, and bulk-loaded into the 
indicator cells as described above, or the transcription factor may be luminescentiy 
labeled after isolation. As a further alternative, the transcription factor is expressed by 
the indicator cell, which is subsequently contacted with a luminescent label, such as an 
antibody, that detects the transcription fector. 

In a further aspect, kits are provided for analyzing transcription factor activation, 
comprising an antibody that specifically recognizes a transcription fector of interest, 
and instructions for using the antibody for carrying out die methods described above. 
In a preferred embodiment, the transcription fector-specific antibody, or a secondary 
antibody that detects the transcription factor antibody, is luminescentiy labeled. In 
further preferred embodiments, flie kit contains cells fliat express tiie transcription 
factor of interest, and/or tiie kit contains a compound that is known to modify activation 
of tiie tianscription factor of interest, including but not limited to platelet derived 
growtii factor (PDGF) and serum, which botii modify fos activation; and interleukin 
lOL-l) and tumor necrosis factor (TNF), which botii modify NF-KB activation. 

In anotiier embodiment, tiie kit comprises a. recombinant expression vector 
comprising a nucleic acid encoding a ti-anscription factor of interest tiiat translocates 
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from the cytoplasm to the nucleus upon activation, and instructions for using the 
expression vector to identify conipounds that modify transcription factor activation in a 
cell of interest Alternatively, the kits contain a purified, luminescently labeled 
transcnption factor. In a preferred embodiment, the transcription factor is expressed as 
a fusion protein with a luminescent protein, including but not Umited to green 
fluorescent protein, luceriferase, or mutants or fragments diereof. In various preferred 
embodiments, the kit fimher contains cells that are transfected with the expression 
vector, an antibody or fragment that specifically bind to the transcription factor of 
mterest. and/or a compound that is known to modify activation of the transcription 
factor of interest (as above). 

b. Protein Kinases 

The cytoplasm to nucleus screening methods can also be used to analyze the 
activation of any protein kinase that is present in an inactive state in the cytoplasm and 
.s transported to the nucleus upon activation, or that phosphorylates a substrate that 
translocates from the cytoplasm to the nucleus upon phosphorylation. Examples of 
appropriate protein kinases include, but are not limited to extraceUular signal-regulated 
protein kinases (ERKs). c-Jun amino-terminal kinases (JNKs). Fos regulating protein 
kinases (FRKs). p38 mitogen activated protein kinase (p38MAPK). protein kinase A 
(PKA). and mitogen activated protein kinase kinases (MAPKKs). (For example see 
Hall, et al. 1999. J Biol Chem. nA-31^Z3; Han. et al. 1995. Biochim. Biophys. \cta 
1265:224-227; Jaaro et al. 1997. Proc. Natl Acad. ScL U.S.A. 94:3742-3747; Taylor et 
al. 1994. J. Biol. Chem. 269:308-318; Zhao. Q.. and F. S. Lee. 1999. J Biol oLm 
274:8355-8; PaoliUoet al. im. J Biol Chem. 274:6546-52; Coso et al. 1995. Cell 
81:1137-1146; Tibbies. L.A.. and J.R. Woodgett 1999. Cell Mol Life Sci. 55:1230-54; 
Schaeffer, H.J.. and M.J. Weber. 1999. Mol Cell Biol. 19:2435-44.) 

Alternatively, protein kinase activity is assayed by monitoring translocation of a 
luminescenUy labeled protein kinase substrate from the cytoplasm to the nucleus after 
being phosphorylated by the protein kinase of interest In this embodiment, the 
substrate is non-phosphorylated and cytoplasmic prior to phosphorylation, and is 
translocated to the nucleus upon phosphorylation by the protein kinase. TTiere is no 
requirement that the protein kinase itself translocates from the cytoplasm to the nucleus 
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in this embodiment. Examples of such substrates (and the coixesponding protein 
kmase) include, but are not limited to cjun (JNK substrate); fos (FRK substrate) and 
p38 (p38 MAPK substrate). 

Thus, in these embodiments, indicator cells are treated with test compounds and 
the distribution of luminescently labeled protein kinase or protein kinase substrate is 
measured in space and time using a ceU screening system, such as the one disclosed 
above. The Imninescently labeled protein kinase or protein kinase substrate may be 
expressed by or added to the cells either before, together with, or after contacting the 
cells with a test compound. For example, the protein kinase or protein kinase substrate 
may be expressed as a luminescenUy labeled protein chimera by tiansfected indicator 
cells. Alternatively, the luminescently labeled protein kinase or protein kinase 
substrate may be expressed, isolated, and bulk-loaded into the indicator ceUs as 
described above, or the protein kinase or protein kinase substrate may be luminescently 
labeled after isolation. As a fimher alternative, the protein kinase or protein kinase 
15 substrate is expressed by the indicator cell, which is subsequenfly contacted with a 
luminescent label, such as a labeled antibody, that detects the protein kinase or protein 
kinase substrate. 

In a fiirther embodiment, protein kinase activity is assayed by monitoring the 
phosphorylation state (ie: phosphorylated or not phosphoiylated) of a protein kinase 

20 substrate. In this embodiment, there is no requirement that either the protein kinase or 
the protein kinase substrate translocate from the cytoplasm to the nucleus upon 
activation, m a preferred embodiment, phosphorylation state is monitored by 
contacting the cells with an antibody that binds only to the phosphorylated form of the 
protein kinase substrate of interest (For example, as disclosed in U.S. Patent No 

25 5,599,681). 

In another preferred embodiment, a biosensor of phosphorylation is used. For 
example, a luminescently labeled protein or fragment thereof can be fused to a protein 
that has been engineered to contain (a) a phosphorylation site that is recognized by a 
protein kinase of interest; and (b) a nuclear localization signal that is umnasked by the 
30 phosphorylation. Such a biosensor will thus be translocated to the nucleus upon 
phosphorylation, and its translocation can be used as a measure of protein kinase 
activation. 
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In another aspect, kits are provided for analyzing protein kinase activation, 
comprising a primary antibody that specificaUy binds to a protein kinase, a protein 
kinase substrate, or a phosphoiylated form of the protein kinase substrate of interest and 
instructions for using the primary antibody to identify compounds that modify protein 
kinase activation in a cell of interest In a preferred embodiment, die primary antibody, 
or a secondary antibody that detects the primary antibody, is luminescentiy labeled. Iii 
other preferred embodiments, the kit further comprises cells that express the protein 
kinase of interest, and/or a compound that is known to modify ^tivation of the protein 
kinase of interest, including but not limited to dibutyryl cAMP (modifies PKA), 
foTskolin (PKA), and anisomycin (p38MAPK). 

Alternatively, the kits comprise an expression vector encoding a protein kinase 
or a protein kinase substrate of interest that translocates fiom the cytoplasm to the 
nucleus upon activation and instructions for using the expression vector to identify 
compounds that modify protein kinase activation in a cell of interest Alternatively, the 
kits contain a purified, luminescentiy labeled protein kinase or protein kinase substrate. 
In a prefeired embodiment the protein kinase or protein kinase substrate of interest is 
expressed as a fusion protein witii a luminescent protein. In further prefeired 
embodunents, the kit further comprises ceUs fliat are transfected witii the expression 
vector, an antibody or fragment fliereof tiiat specifically binds to the protein kinase or 
protein kinase substrate of interest and/or a compound that is known to modify 
activation of the protein kinase of interest, (as above) 

In anoUier aspect tiie present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causing a ceU 
screening system to execute tiie methods disclosed for analyzing transcription fector or 
protein kinase activation, wherein the cell screening system comprises an optical 
system witii a stage adapted for holding a plate containing cells, a digital camera, a 
means for directing fluorescence or luminescence emitted from tiie cells to the digital 
camera, and a computer means for receiving and processing tfie digital data fiom the 
digital camera. 



43 



wo 00/50872 

PCT/USOO/04794 

B^leZ ^"tomated Screen for Compounds that Modify CeUularM 

Changes in cell size aie associated with a number of ceUular conditions, such as 
hypertrophy, cell attachment and spreading, differentiation, growth and division, 
necn^tic and programmed cell death. ceU motility, morphogenesis, tube formation, and 

colony formation. 

For example, cellular hypertrophy has been associated with a cascade of 
alterations in gene expression and can be characterized in cell culture by an alteration in 
cell size, that is clearly visible in adherent cells growing on a coverslip. 

Cell size can also be measured to determine the attachment and spi^ming of 
adherent cells. CeU spreading is the result of selective binding of cell surface receptors 
to substrate ligands and subsequent activation of signaling pathways to the 
cytoskeleton. Cell attachment and spreading to substrate molecules is an important step 
for the metastasis of cancer cells, leukocyte activation during the inflammatory 
response, keratinocyte movement during wound healing, and endothelial cell 
movement during angiogenesis. Compounds that affect these surfece receptors, 
signaling pathways, or the cytoskeleton will affect cell spreading and can be screened 
by measuring cell size. 

Total ceUular area can be monitored by labeling the entire cell body or the cell 
cytoplasm using cytoskeletal markers, cytosolic volume mariceis. or cell surface 
maricers. in conjunction with a DNA label. Examples of such labels (many available 
from Molecular Probes (Eugene. Oregon) and Sigma Chemical Co. (St. Louis. 
Missouri)) include the following: 
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CELL SIZE AND AREA MARKERS 
Cytoskeletol Markere 



ALEXA 488 phalloi din (Molecular Probes. Qrc frnn) 



Tubulin-g reen fluorescent protein chimeras 
Cvtokeratin-green fluorescent protein chimeras 



Antibodies to cvtoskeletal proteins 



Cytosoiic Volume Markers 



• Green fluorescent proteins 



Chlorometiivlfluorescein diacetatc (CMFDA) 
* Calccin green 



BCECF/AM ester 
Rhodamine dcxtrah 



CeU Surface Markers for Lipid, Protein, or Oligosaccharidg 



Dihexadecyl tetramethylindocarfaocvani ne perehlorate (DnC16) lipid dyes 

- — TnethYlammonium propyl dibutvlamino styryl pvridinium (m4-64. FM 1-41^ linirf Hv^. 
♦ MITOTRACKF.T?™ Green FM — 



* "''gosaccarides such as fluorescein r p ncanavalin A nr wheat germ a^a hititiin 
S YPRO Red non-specific protein markers 
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Anfa-bodies to various surfac e proteins such as epidermal piowth factor 



B iotin lab elinR of surface proteins followed by fluorescent strepavidin labeleinp 



Protocols for cell staining with these various agents are well known to those 
skiUed in the art. Cells are stained live or after fixation and the cell area can be 
measured. For example, live cells stained with DiIC16 have homogeneously labeled 
plasma membranes, and the projected cross-sectional area of the cell is uniformly 
discriminated from background by fluorescence intensity of the dye. Live cells stained 
with cytosoiic stains such as CMFDA produce a fluorescence intensity that is 
proportional to cell thickness. Although cell labeling is dumner in thin regions of the 
cell, total ceU area can be discriminated from background. Fixed cells can be stained 
with cytoskeletal markers such as ALEXA™ 488 phaUoidin that label polymerized 
actin. Phalloidin does not homogeneously stain the cytoplasm, but still pennits 
discrimination of the total cell area torn background. 

15 Cellular hypertrophy 

A screen to analyze cellular hypertrophy is implemented using the foUowing 
strategy. Primary rat myocytes can be cultured in 96 well plates, treated with various 
compounds and then fixed and labeled with a fluorescent marker for the cell membrane 
or cytoplasm, or cytoskeleton, such as an antibody to a ceU surface marker or a 
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fluorescent marker for the cytoskeleton like rhodamme-phalloidin, in combination with 
a DNA label Uke Hoechst 

After focusing on the Hoechst labeled nuclei, two unages are acquired, one of 
the Hoechst labeled nuclei and one of the fluorescent cytoplasm image. TTie nuclei are 
identified by thresholding to create a mask and then comparing the morphological 
descriptors of the mask with a set of user defined descriptor values. Each non-nucleus 
image (or "cytoplasmic image") is then processed separately. The original cytoplasm 
image can be Ihresholded. creating a cytoplasmic mask image. Local regions containing 
cells are defined around the nuclei. The limits of the cells in diose regions are then 
defined by a local dynamic threshold operation on the same i^on in the fluorescent 
antibody image. A sequence of erosions and dilations is used to sq>arate slightly 
touching cells and a second set of morphological descriptors is used to identify single 
cells. The area of the individual cells is tabulated in order to define the distribution of 
cell sizes for comparison with size data fiom normal and hypertrophic cells. 

Responses fiom entire 96-well plates (measured as average cytoplasmic 
area/cell) were analyzed by the above methods, and the results demonstrated that the 
assay will perform the same on a well-to-well, plate-to-plate, and day-to-day basis 
(below a 15% cov for maximum signal). The data showed very good correlation for 
each day, and tiiat there was no variability due to well position in the plate. 

The following totals can be computed for the field. The aggregate whole 
nucleus area is the number of nonzero pixels in the nuclear mask. The average whole 
nucleus area is the aggregate whole nucleus area divided by the total nmnber of nuclei. 
For each cytoplasm image several values can be computed. These are the total 
cytoplasmic area, which is the count of nonzero pixels in the cytoplasmic mask. The 
aggregate cytoplasm intensity is the sum of the intensities of aU pixels in the 
cytoplasmic masL The cytoplasmic area per nucleus is the total cytoplasmic area 
divided by the total nucleus count. The cytoplasmic intensity per nucleus is the 
aggregate cytoplasm intensity divided by the total nucleus count. The average 
cytoplasm intensity is the aggregate cytoplasm intensity divided by the cytoplasm area. 
The cytoplasm nucleus ratio is the total cytoplasm area divided by the total nucleus 
area. 
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AdditionaUy, one or more fluorescent antibodies to other ceUular proteins, such 
as the major muscle proteins actin or myosin, can be included. Images of these 
additional labeled proteins can be acquired and stored with the above images, for later 
review, to identify anomaUes in the distribution and morphology of these proteins in 
hypertrophic cells. This example of a multi-parametric screen allows for simultaneous 
analysis of cellular hypertrophy and changes in actin or myosin distribution. 

One of skill in the art wiU recognize tiiat while tiie example analyzes myocyte 
hypertrophy, the methods can be applied to analyzing hypertrophy, or general 
morphological changes in any cell type. 

Cell morphology assays for prostate carcinoma 

Cell spreading is a measure of the response of cell surface receptors to substrate 
attachment ligands. Spreading is proportional to the ligand concentration or to flie 
concentration ofcompounds thai reduce receptor-ligand&nction. One example of 
selective cell-substrate attachment is prostate carcinoma cell adhesion to the 
extiacellular matrix protein collagen. Prostate carcinoma cells metastasize to bone via 
selective adhesion to coUagrau 

Compounds that interfere with metastasis of prostate carcinoma ceUs were 
screened as foUows. PC3 human prostate carcinoma cells were cultured in media with 
appropriate stimulants and are passaged to collagen coated 96 well plates. Ligand 
concentration can be varied or inhibitors of cell spreading can be added to tiie wells. 
Examples of compounds tiiat can affect spreading are receptor antagonists such as 
integrin- or proteoglycan-blockihg antibodies, signaling inhftitors including 
phosphatidyl inositol-3 kinase inhibitors, and cytoskeletal inhibitors such as 
cytochalasin D. After two hours, cells were fixed and stained with ALEXA™ 488 
phaUoidin (Molecular Probes) and Hoechst 33342 as per the protocol for ceUuIar 
hypertrophy. The size of cells under these various conditions, as measured by 
cytoplasmic staining, can be distinguished above background levels. The number of 
cells per field is determined by measuring the number of nuclei stained witii the 
Hoechst DNA dye. The area per cell is found by dividing the cytoplasmic area 
(phalloidin image) by tiie cell number (Hoechst image). The size of cells is 
proportional to the ligand-receptor function. Since the area is determined by ligand 
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concentration and by the resultant function of the ceU, drug efficacy, as well as drug 
potency, can be determined by this cell-based assay. Other measurements can be made 
as discussed above for cellular hypertrophy. 

The methods for analyzing cellular morphology can be used in a combined high 
throughput-high content screen. In one example, the high throughput mode scans the 
whole well for an increase in fluorescent phalloidin intensity. A threshold is set above 
which both nuclei (Hoechst) and cells (phaUoidin) are measured in a high content 
mode. In another example, an enviromnental biosensor (examples include, but are not 
limited to. those biosensors that are sensitive to calcium and pH changes) is added to 
the cells, and the cells are contacted with a compound. The cells are scanned in a high 
throughput mode, and those wells that exceed a pre-determined threshold for 
luminescence of the biosensor are scanned in a high content mode. 

In a further aspect, kits are provided for analyzing cellular morphology, 
comprising a luminescent compound that can be used to specifically label the cell 
cytoplasm, membrane, or cytoskeleton (such as those described above), and 
instructions for using the luminescent compound to identify test stimuli that induce or 
inhibit changes in cellular morphology according to the above methods. In a preferred 
embodiment, the kit further comprises a luminescent marker for cell nuclei. In a further 
preferred embodiment, the kit comprises at least one compound that is known to 
modify cellular morphology, including, but not limited to integrin- or proteoglycan- 
blocking antibodies, signaling inhibitors including phosphatidyl inositol-3 kinase 
inhibitors, and cytoskeletal inhibitors such as cytochalasin D. 

In another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causmg a cell 
screening system to execute the disclosed methods for analyzing ceUular morphology, 
wherein the cell screening system comprises an optical system with a stage adapted for 
holding a plate containing cells, a digital camera, a means for directing fluorescence or 
luminescence emitted fix,m the cells to the digital camera, and a computer means for 
receiving and processmg the digital data from the digital camera. 
Examples DualModeHigh Throughput and High-Content Screen 

The following example is a screen for activation of a G-protein coupled receptor 
(GPCR) as detected by the translocation of the GPCR fiom the plasma membrane to a 

48 



wo 00/508:^ 



PCT/USOO/04794 



proximal nuclear location. This example illustrates how a high throughput screen can 
be coupled with a high-content screen in the dual mode System for Cell Based 
Screening. 

G-protein coupled receptors are a large class of 7 trans-membrane domain ceU 
surface receptors. Ligands for these receptors stimulate a cascade of secondary signals 
in the cell, which may include, but are not limited to. Ca"^ transients, cyclic AMP 
production, inositol triphosphate (IP3) production and phosphorylation. Each of these 
signals are rapid, occuring in a matter of seconds to minutes, but are also generic. For 
example, many different GPCRs produce a secondary Ca^ signal when activated. 
Stimulation of a GPCR also results in the transport of that GPCR fiom tiie cell surface 
membrane to an internal, proximal nuclear compartincnt This intemaUzation is a much 
more receptor-specific indicator of activation of a particular receptor tiian are the 
secondary signals described above. 

Figure 19 illusti^es a dual mode screen for activation of a GPCR. Cells 
carrying a stable chimera of the GPCR with a blue fluorescent protein (BFP) would be 
loaded with the acetoxymethylester form of Fluo-3. a cell permeable calcium indicator 
(green fluorescence) that is trapped in Uving cells by the hydrolysis of die esters. They 
would then be deposited into the wells of a microliter plate 601- The weUs would then 
be treated with an array of test compounds using a fluid delivery system, and a short 
sequence of Fluo-3 images of the whole microtiter plate would be acquired and 
analyzed for wells exhibiting a calcium response (i.e.. high throughput mode). TTie 
images would appear like the illustration of the microtiter plate 601 in Figure 19. A 
small number of wells, such as weUs C4 and E9 in the illusti^on, would fluoresce 
more brighUy due to tiieCa^ released upon stimulation of the receptors. TTie locations 
of wells containing compounds that induced a response 602. would then be transferred 
to the HCS program and the optics switched for detailed cell by cell analysis of the blue 
fluorescence for evidence of GPCR ti:anslocation to the perinuclear region. The bottom 
of Figure 19 illustrates the two possible outcomes of tiie analysis of die high resolution 
cell data. The camera images a sub-region m of the well area m producing images 
of the fluorescent cells 6Q1. In well C4. the unifonn distribution of the fluorescence in 
the cells indicates tiiat the receptor has not internalized, implying that the Ca^ response 
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seen was the result of the stimulation of some other signalling sy^ in the ceU TT,e 
cells in weU E9 606 on the other hand, clearly indicate a concentration of the receptor 
m the permuclear region clearly indicating the lull activation of the r«:eptor. Because 
only a few hit wells have to be analyzed with high resolution, the overall thn>uglq,ut of 
the dual mode system can be quite high, comparable to the high throughput system 
alone. 



Example 4 Kinetic High Content Screen 

The following is an example of a screen to measure the kinetics of 
mtemahzation of a receptor. As described above, the stimulation of a GPCR, results in 
the mtemalization of the receptor, with a time course of about 15 min. Simply 
detectmg the endpoint as internalized or not, may not be sufficient for defining the 
potency of a compound as a GPCR agonist or antagonist. However. 3 time points at 5 
mm mtervals would provide information not only about potency during the time course 
of measurement, but would also allow extrapolation of the data to much longer time 
periods. To perform this assay, the sub-region would be defined as two rows the 
samplmg interval as 5 minutes and the total number of time points 3. TT,e system 
would then start by scamiing two rows, and then adding reagent to the two rows 
establidnng the time=0 reference. After reagent addition, the system would again scan' 
the two row sub-region acquiring the first time point data. Since this process would 
take about 250 seconds, includmg scanning back to the beginning of the sub-region, the 
system would wait 50 seconds to begin acquisition of the second time point Two more 
cycles would pHKiuce the three time points and the system would move on to the 
second 2 row sub-region. The final two 2-row sub-it^gions would be scanned to finish 
all the wells on the plate, resulting in four time points for each well over the whole 
plate. Although the time points for the wells would be offset slighUy relative to 
tmie=0. the spacing of the time points would be very close to the required 5 minutes 
and the actual acqui^tion times and results recorded with much greater precision than' 
ui a fixed-cell screoi. 
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Examples ^i^^-content screen of Human glucocorticoid re^^^^^^^ 

One class of HCS involves the drug-induced dynamic redistribution of 
intracellular constituents. TT.e human glucocorticoid receptor (hGR), a single "sensor" 
m the complex enviromnental response machinery of the ceD. binds steroid molecules 
. that have difiused into the cell. T1.e ligand-receptor complex translocates to the 
nucleus where transcriptional activation occurs (Htun et al.. Proc Natl. Acad Sci 
93:4845, 1996). 

In general, hormone receptors are exceUent drug targets because their activity 
lies at the apex of key intr^ellular signaling pathways. Therefore, a high-content 
screen of hGR translocation has distinct advantage over vitro ligand-receptor binding 
assays. n.e availability of up to two more channels of fluorescence in the cell 
screening system of the present invention permits the screen to contain two additional 
parameter, in parallel, such as other receptor., other distinct targets or other cellular 
processes. 

Plasmid construct. A eukaryotic expression plasmid containing a coding 
sequence for a green fluorescent protein - human glucocorticoid receptor (GFP-hGR) 
chm^cra was prepared using GFP mutants (Pahn et al.. Nat. Struct. Biol. 4 361 (1997) 
The construct was used to transfect a human cervical ca,t:inoma cell line (HeLa) 

CeU preparation and transfection. HeLa cells (ATCC CCL-2) were trypsinized 
and plated using DMEM containing 50/0 charcoal/dextran-treated fetal bovine ser^ 
(FBS) (HyClone) and 1% penicillin-streptomycin (C-DMEM) 12-24 hours prior to 
tx^fection and incubated at 37»C and 5% CO. . Transfections were performed by 
calcum phosphate co-precipitation (Graham and Van der Eb. Virology 52-456 1973- 
Sambrook et al.. (1989). Molecular Cloning: A Laboratory Manual, Second ed Cold 
Sprmg Harbor Laboratory Pi^. Cold Spring Harbor. 1989) or with Lipofectamine (Life 
Technologies. Gaithersburg. MD). For the calcium phosphate transfections, the 
medmm was replaced, prior to transfection, with DMEM containing 5o/„ 
charcoal/dextran-treated FBS. Cells were incubated with the calcium phosphate-DNA 

precipitate for 4-5 hours at 37T anrf so/ rn u j ^ . 

uouis at J/ u and 5/0 CO2. washed 3-4 times with DMEM to 

rernove the precipitate, followed by the addition of C-DMEM. 

Lipofectamine transfections were performed in serum-free DMEM without 
antabioucs according to the manufacturer's instructions (Life Technologies. 
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Gaithersburg. MD). Following a 2-3 hour incubation with the DNA-liposome 
complexes, the medium was removed and replaced with C-DMEM. AU transfected 
cells in 96-weU microtiter plates were incubated at 33-0 and 5% CO2 for 24-18 hours 
prior to drug treatment Experiments were performed with the receptor expressed 
transiently in HeLa cells. 

Dexamethasone induction of GFP-hGR translocation. To obtain receptor- 
ligand translocation kinetic data, nuclei of transfected cells were first labeled with 5 
jig/ml Hoechst 33342 (Molecular Probes) in C-DMEM for 20 minutes at 33'>C and 5% 
CO2. Cells were washed once in Hank's Balanced Salt Solution (HBSS) followed by 
the addition of 100 nM dexamethasone in HBSS with 1% charcoal/dextian-treated 
FBS. To obtain fixed time point dexamethasone titration data, transfected HeLa cells 
were first washed with DMEM and then incubated at 33'>C and 5% CO2 for 1 h in the 
presence of 0 - 1000 nM dexamethasone in DMEM containing \% charcoal/dextran- 
treated FBS. Cells.were analyzed live or they were rinsed with HBSS, fixed for 15 min 
with 3.7% formaldehyde in HBSS, stained with Hoechst 33342. and washed before 
analysis. The intracellular GFP-hGR fluorescence signal was not diminished by this 
fixation procedure. 

Image acquisition and analysis. Kinetic data were collected by acquiring 
fluorescence image pairs (GFP-hGR and Hoechst 33342-labeled nuclei) from fields of 
living cells at 1 min intervals for 30 min after the addition of dexamethasone. 
Likewise, image pairs were obtained fi^m each well of the fixed time point screening 
plates 1 h after the addition of dexamethasone. In both cases, the image pairs obtained 
at each time point were used to define nuclear and cytoplasmic regions in each ceU. 
Translocation of GFP-hGR was calculated by dividing the integrated fluorescence 
intensity of GFP-hGR in the nucleus by the integrated fluorescence intensity of the 
chimera in the cytoplasm or as a nuclear-cytoplasmic difference of GFP fluorescence. 
In the fixed time point screen this translocation ratio was calculated from data obtained 
fiora at least 200 cells at each concentration of dexamethasone tested. Drug-induced 
tianslocation of GFP-hGR from the cytoplasm to the nucleus was therefore correlated 
with an increase in the translocation ratio. 

Results. Figure 20 schematically displays the drug-induced cytoplasm 252 to 
nucleus 252 translocation of the human glucocorticoid receptor. The upper pair of 
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schematic diagrams depicts the localization of GFP-hGR within the cell before 25fi (A) 
and after 251 (B) stimulation with dexamethasone. Under these experimental 
conditions, the drug induces a large portion of the cytoplasmic GFP-hGR to translocate 
into the nucleus. This redistribution is quantified by determining the integrated 
intensities ratio of the cytoplasmic and nuclear fluorescence in treated 255 and 
untreated 254 cells. The lower pair of fluorescence miciogr^hs show the dynamic 
redistribution of GFP-hGR in a single cell, before 254 and after 255 treatment ITie 
HCS is performed on wells containing hundreds to thousands of transfected cells and 
the translocation is quantified for each ceU in the field exhibitmg GFP fluorescence. 
Although the use of a stably transfected cell line would yield the most consistently 
labeled cells, the heterogeneous levels of GFP-hGR expression induced by transient 
transfection did not interfere with analysis by the cell screening system of die present 
inventioa 

To execute the screen, the cell screening system scans each weU of the plate, 
images a population of cells in each, and analyzes cells individually. Here, two 
channels of fluorescence are used to define the cytoplasmic and nuclear distribution of 
the GFP-hGR within each cell. Depicted in Figure 21 is the graphical user interface of 
the cell screening system near the end of a GFP-hGR screen. The user interface depicts 
the parallel data collection and analysis capability of the system. The windows labeled 
"Nucleus" 2^1 and "GFP-hGR" 262 show the pair of fluorescence images being 
obtained and analyzed in a single field. The window labeled "Color Overlay" 260 is 
formed by pseudocoloring die above images and merging them so the user can 
immediately identify cellular changes. Within the "Stored Object Regions" window 
26S. an image containing each analyzed cell and its neighbors is presented as it is 
archived. Furthemiore, as the HCS data are being collected, they are analyzed, in this 
case for GFP-hGR translocation, and translated into an immediate "hir response. The 
96 well plate depicted in the lower window of the screen 267 shows which wells have 
met a set of user-defined screening criteria. For example, a white-colored well 262 
indicates that the drug-induced translocation has exceeded a predetermined threshold 
value of 50%. On the other hand, a black-coloied well 22Q indicates that the dmg being 
tested induced less than 10% translocation. Gray-colored wells 2M indicate "hits" 
where the translocation value fell between 10% and 50%. Row "E" on the 96 well 
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plate being analyzed 266 shows a titration with a drug known to activate GFP-hGR 
translocation, dexamethasone. This example screen used only two fluorescence 
channels. Two additional channels (Channels 3 263 and 4 264) are available for 
parallel analysis of other specific targets, cell pnx:esses. or cytotoxicity to create 
multiple parameter screens. 

There is a link between the image database and the information database that is 
a powerful tool during tiie validation process of new screens. At the completion of a 
screen, the user has total access to image and calculated data (Figure 22). The 
comprehensive data analysis package of the cell screening system allows the user to 
examine HCS data at multiple levels. Images m and detailed data in a spread sheet 
222 for individual cells can be viewed separately, or summary data can be plotted. For 
example, the calculated results of a single parameter for each cell in a 96 well plate are 
shown in the panel labeled Graph 1 225. By selecting a single point in the graph, the 
user can display the entire data set for a particular cell that is recalled fiom an existing 
database. Shown here are the image pair 22S and detailed fluorescence and 
morphometric data fiom a single ceU (Cell #118. gray line 277). The laxgo graphical 
insert 278_shows the results of dexamethasone concentration on tiae translocation of 
GFP-hGR. Each point is die average ofdata from at least 200 cells. The calculated 
ECso for dexamethasone in this assay is 2 nM. 

A powerfiil. aspect of HCS with the ceU screening system is the capability of 
kinetic measurements using multicolor fluorescence and morphometric parameters in 
living cells. Temporal and spatial measurements can be made on single cells within a 
population of cells in a field. Figure 23 shows kinetic data for the dexamethasone- 
induced translocation of GFP-hGR in several cells within a smgle field. Human HeLa 
cells transfected with GFP-hGR were treated with 100 nM dexamethasone and the 
teanslocation of GFP-hGR was measured over time in a population of single cells. The 
graph shows the response of transfected cells 285, 286. 282. and 28S and non- 
tiansfected cells 289. These data also illusti^e the ability to analyze cells with 
difTerent expression levels. 
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Example 6 High-content screen of drug-induced apoptosis 

Apoptosis is a complex ceUuIar program that involves myriad molecular events 
and pathways. To understand the mechanisms of drug action on this process, it is 
essential to measure as many of these events within cells as possible with temporal and 
spatial resolution. Therefore, an apoptosis screen that requires Uttle ceU sample 
preparation yet provides an automated readout of several apoptosis-ielated parameters 
would be ideal. A cell-based assay designed for the cell screening system has been 
used to simultaneously quantify several of the morphological, organellar, and 
macromolecular halhnailcs of paclitaxel-induced q)optosis. 

Cea preparation. The cells chosen for this study were mouse connective tissue 
fibroblasts (L.929; ATCC CCL-1) and a highly invasive gUoblastoma cell line (SNB- 
19; ATCC CRL-2219) (Welch et al.. In Vitro Cell. Dev. Biol. 31:610, 1995). The day 
before treatment with an apoptosis inducing drug, 3500 cells were placed into each well 
of a 96-well plate and incubated overnight at 37»C in a humidified 5% CO2 
atmosphere. The following day, the culture medium was removed bom each well and 
replaced with fresh medium containing various concentrations of paclitaxel (0 - 50 
HM) fiiom a 20 mM stock made in DMSO. The maximal concentration of DMSO used 
in these experiments was 0.25%. The cells were then incubated for 26 h as above. At 
the end of the paclitaxel treatment period, each weU received fresh medium containing 
750 nM MitoTracker Red (Molecular Probes;" Eugene, OR) and 3 ^g/ml Hoechst 33342 
DNA-binding dye (Molecular Probes) and was incubated as above for 20 min. Each 
well on the plate was then washed with HBSS and fixed with 3.7% formaldehyde in 
HBSS for 15 min at room temperature. The formaldehyde was washed out with HBSS 
and the cells were permeabilized for 90 s with 0.5% (v/v) Triton X-100, washed with 
HBSS, incubated with 2 U ml ' Bodipy FL phaUacidin (Molecular Probes) for 30 min, 
and washed with HBSS. The wells on the plate were then fiUed with 200 jil HBSS, 
sealed, and the plate stored at 4''C if necessary. The fluorescence signals from plates 
stored this way were stable for at least two weeks after preparation. As in the nuclear 
translocation assay, fluorescence reagents can be designed to convert this assay into a 
live cell high-content screen. 

Image acquisition and analysis on the ArrayScan System. The fluorescence 
intensity of intracellular MitoTracker Red, Hoechst 33342, and Bodipy FL phallacidin 
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was measured with the cell screening system as described s^ra. Moiphometric data 
from each pair of images obtained from each well was also obtained to detect each 
object in the image field {e.g., cells and nuclei), and to calculate its size, shape, and 
integrated intensity. 

Calculations and output A total of 50-250 cells were measure! per image 
field. For each field of cells, the following calculations were perfonned: (1) The 
average nuclear area W) was calculated by dividing the total nuclear area in a field 
by the number of nuclei detected. (2) The average nuclear perimeter « was 
calculated by dividing the sum of the perimeters of aU nuclei in a field by the number 
of nuclei detected in that field. Highly convoluted apoptotic nuclei had the largest 
nuclear perimeter values. (3) The average nuclear brightness was calculated by dividing 
the integrated intensity of the entire field of nuclei by the number of nuclei m that field. 
An increase in nuclear brightness was correlated with increased DNA content (4) The 
average cellular brightness was calculated by dividing the integrated intensity of an 
entire field of cells stained with MitoTracker dye by the number of nuclei in that field. 
Because the amount of MitoTracker dye that accumulates within the mitochondria is 
proportional to the mitochondrial potential, an increase in the average cell brightness is 
consistent with an increase in mitochondrial potential. (5) The average cellular 
brightness was also calculated by dividing the integrated intensity of an entire field of 
cells stained with Bodipy FL phallacidin dye by the number of nuclei in that field. 
Because the phallotoxins bind with high affinity to the polymerized form of actin. the 
amount of Bodipy FL phallacidin dye that accumulates within the ceU is proportional to 
actin polymerization state. An increase in the average cell brightness is consistent with 
an increase in actin polymerization. 

Results. Figure 24 (top panels) shows the changes paclitaxel induced in the 
nuclear morphology of L-929 cells. Increasing amounts of paclitaxel caused nuclei to 
enlarge and fragment 221 a halhnaik of apoptosis. Quantitative analysis of these and 
other images obtained by the cell screening system is presented m the same figure. 
Each parameter measured showed that the L-929 cells m were less sensitive to low 
concentrations of pacUtaxel than were SNB-19 cells 22Z. At higher concentrations 
though, the L-929 cells showed a response for each parameter measured. n,e 
multiparameter approach of this assay is usefiil in dissecting the mechanisms of drug 
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action. For example, the area, brightness, and fragmentation of the nucleus m and 
actin polymerization values 294 reached a maximum value when SNB-19 cells were 
treated with 10 nM paclitaxcl (Figure 24; top and bottom graphs). However, 
mitochondrial potential 295 was minimal at the same concentration of paclitaxcl 
(Figure 24; middle graph). Hie fact that aU the parameters measured approached 
control levels at increasing paclitaxcl concentrations (>10 nM) suggests that SNB-19 
cells have low afSnity drug metabolic or clearance pathways that are compensatory at 
sufficiently high levels of the drug. Contrasting the drug sensitivity of SNB-19 ceUs 
m L-929 showed a different response to paclitaxcl 2%. These fibroblastic cells 
showed a maximal response in many parameters at 5 paclitaxcl, a 500-fold higher 
dose than SNB-19 cells. Furthermore, the L-929 cells did not show a sharp decrease in 
mitochondrial potential 2S1 at any of the paclitaxel concentrations tested. This result is 
consistent with the presence of unique apoptosis pathways between a normal and 
cancer ceU line, nierefore. these results mdicate that a relatively simple fluorescence 
labeling protocol can be coupled with the cell screening system of the present invention 
to produce a high-content screen of key events involved in programmed cell death. 

Background 

A key to the mechanism of apoptosis .was the discovery that, irrespective of the 
lethal stimulus, death results in identical apoptotic morphology that includes cell and 
organelle dismantling and repackaging, DNA cleavage to nucleosome sized fragments, 
and engulfinent of the fragmented ceU to avoid an inflammatory response. Apoptosis is 
therefore distinct from necrosis, which is mediated more by acute tiamna to a cell, 
resulting in spillage of potentiaUy toxic and antigenic cellular components into the' 
intercellular milieu, leading to an inflammatory response. 

TTie criteria for determining whether a cell is undergoing apoptosis (Wyllie et 
al. 1980. IntRev Cytol. 68:251-306; Thompson, 1995. Science. 267:1456-62; Majno 
andJoris. mS.AmJPathol. 146:3-15; Allen etal. I99i. Cell Mol Life Sci, SAATJ^S) 
include distinct morphological changes in the appearance of the ceU, as weU as 
alterations in biochemical and molecular markers. For example, apoptotic cells often 
undergo cytoplasmic membrane blebbing, their chromosomes rapidly condense and 
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aggregate around the nuclear peripheiy, the nucleus fragments, and small apoptotic 
bodies arc formed. In many, but not all, apoptotic cells, chromatin becomes a target for 
specific nucleases that cleave the DNA. 

Apoptosis is commonly accompanied by a characteristic change in nuclear 
morphology (chromatin condensation or fragmentation) and a step-wise fragmentation 
of DNA culminating in the formation of mono- and/or oligomeric fragments of 200 
base pairs. Specific changes in organellar function, such as mitochondrial membrane 
potential, occur. In addition, specific cysteine proteases (caspases) are activated, which 
catalyzes a highly selective pattern of protein degradation by proteolytic cleavage after 
specific aspartic acid residues. In addition, the external surfece exposure of 
phosphatidylserine residues (normally on the inner membrane leaflet) allows for the 
recognition and elimination of apoptotic cells, before die membrane breaks up and 
cytosol or organelles spill into the intercellular space and elicit inflammatory reactions. 
Moreover, cells undergomg apoptosis tend to shrink, while also having a reduced 
intracellular potassium level. 

The general patterns of apoptotic signals are very similar among different ceU types 
and apoptotic inducers. However, the details of the pathways actually vary significantly 
depending on ceU type and inducer. The dependence and independence of various signal 
transduction pathways involved in apopto^ are currentiy topics of intense research. We 
show here that die pathway also varies depending upon the dose of the inducer in specific 
cell types. 

Nuclear Morphology 

Cells undergoing apoptosis generally exhibit two types of nuclear change, 
fragmentation or condensation ((Majno and Joris, 1995), (Eamshaw, 1995)). The 
response in a given cell type appears to vary depending on the apoptotic inducer. 
During nuclear fragmentation, a circular or oval nucleus becomes increasingly lobular. 
Eventually, the nucleus firagments dramatically into multiple sub-nuclei. Sometimes the 
density of the chromatin within the lobular nucleus may show spatial variations in 
distribution (hetercchromatization), approximating the margination seen in nuclear 
condensation. 
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Nuclear condensation has been reported in some cell types, such as MCF-7 
(Saunders et al. 1997. Int J Cancer. 70:214-20). Condensation appears to arise as a 
consequence of the loss of structural integrity of the euchromatin, nuclear matrix and 
nuclear lamina (Hendzel et al. 1998. J Biol Chem. 273:24470-8). During nuclear 
condensation, the chromatin concentrates near the maigin of the nucleus, leading to the 
overall shrinkage of the nucleus. Thus, the use of nuclear morphology as a measure of 
^optosis must take both condensation and fiagmentation into account 

Materia] and Methods 

Cells were pkted into 96-weU plates at densities of 3 x 10^ to 1 x lO" cellis/weU. 
The following day apoptotic inducers were added at indicated concentrations and cells 
were incubated for indicated time periods (usuaUy 16-30 hours). The next day medium 
was removed and cells were stained with 5 ^ig/ml Hoechst (Molecular Probes. Inc.) in 
fresh medium and incubated for 30 minutes at 37»C. Cells were washed in Hank's 
Balanced Salt Solution (HBSS) and fixed with 3.7% formaldehyde in HBSS at room 
temperature. Cells were washed 2X with HBSS at room temperature and the plate was 
sealed. 

Quantitation of changes in nuclear morphology upon induction of apoptosis was 
accomplished by (1) measuring the effective size of the nuclear region; and (2) 
measuring the degree of convolution of the perimeter. The size parameter provides the 
more sensitive measure of nuclear condensation, whereas the perimeter measure 
provides a more sensitive measure of nuclear fragmentation. 

Results & Discussion 

L929 cells responded to both staurosporine (30 hours) and paclitaxel (30 hours) 
with a dose-dq)endent change in nuclear morphology (Fig 25A and 25B). BHK celb 
illustrated a sUghfly more complicated, yet clearly visible response. Staurosporine 
appeared to stimulate nuclear condensation at lower doses and nuclear fiagmentation at 
higher doses (Fig 25C and 25D). In contrast. pacUtaxel induced a consistent increase in 
nuclear fiagmentation witfi increasing concentrations. The response of MCF-7 cells 
varied diamaticaUy depending upon tiie ^optotic inducer. Staurosporine appeared to 
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elicit nuclear condensation whereas paclitaxel induced nuclear fragmentation (Fig 25E 
and 25F). 

Kgure 26 illustrates the dose response of cells in tenns of both nuclear size and 
nuclear perimeter convolution. There appears to be a swelling of the nuclei that 
precedes the fragmentation. 

Result of evaluation: Differential responses by ceU lines and by apoptotic 
inducers were observed in a dose dependent manner, indicating that this assay will be 
useful for detecting changes in flie nucleus characteristic of ^optosis. 

Actin reorganization 

We assessed changes in the actin cytoskeleton as a potential parameter related 
to apoptotic changes. This was based on preliminary observations of an early increase 
in f-actin content detected with fluorescent phaUoidin labeling, an f-actin specific stain 
(our unpubUshed data; Levee etal. 1996. Am J Physiol. 27 UCl98U92;Maekawactal. 
1996. Clin Exp Immunol. 105:389-96). Changes in the actin cytoskeleton during 
^optosis have not been observed in all ceU types. (Endresen et al. 1995. Cytometry. 
20: 162-71, van Engeland et aL 1997. Exp Cell Res. 235:421-30). 
Material and Methods 

Cells were plated in 96-well plates at densities of 3 x lo' to 1 x 10* cells/weU. 
The following day apoptotic inducers were' added at indicated concentrations. Cells 
were incubated for the indicated time periods (usually 16-30 hours). The next day the 
medium was removed and cells were stained with 5 jig/ml Hoechst (Molecular Probes. 
Inc.) in fresh medium and incubated for 30 minutes at 30°C. Cells were washed in 
HBSS and fixed with 3.7% formaldehyde in HBSS at room temperature. Plates were 
washed with HBSS and permeabilized with 0.5% v/v Triton X-100 in HBSS at mom 
temperature. Plates were washed in HBSS and stained with 100 ^l of lU/ml of Alexa 
488 Phalloidin stock (100 fil/weU, Molecular Probes. Inc.). Cells were washed 2X with 
HBSS at RT and the plate was sealed. 

Quantitation of f-actin content was accomplished by measuring the intensity of 
phalloidin staining around the nucleus. This was determined to be a reasonable 
approximation of a fiiU cytoplasmic average of the intensity. The mask used to 
approximate this cytoplasmic measure was derived from the nuclear mask defined by 
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the Hoechst stain. Derivation was accompUshed by combinations of erosions and 
dilations. 



Results and Discussion 

Changes in f-actin content varied based on cell type and apoptotic inducer (Fig 
27). Staurosporine (30 hours) induced increases in f-actin in L929 (Fig. 27A) and BHK 
(Fig. 27B) cells. MCF-7 cells exhibited a concentration-dependent response. At low 
concentrations (Fig. 27E) there appeared to be a decrease in f-actin content At higher 
concentrations, f-actin content increased. Paclitaxel (30 houre) treatment led to a wide 
variety of response!. L929 cells responded wift graded increases in f-actin (Fig. 27B) 
whereas both BHK and MCF-7 responses were highly variable (Figs. 27D & 27F, 
respectively). 

Result of Evaluation: Both increases and decreases m signal intensity were 
measured for several cell lines and found to exhibit a concentration dependent 
response. For certain cell line/apoptotic inducer paire this could be a statistically 
significant apoptotic indicator. 

Changes in Mitochondrial Mass^otential 
Introduction 

Changes in mitochondria play a central role in apoptosis (Henkart and 
Grinstein. 1996. 7 £r/, Med. 183:1293-5). Mitochondria release ^optogenic factors 
through the outer membrane and dissipate the electrochemical gradient of the inner 
membrane. This is thought to occur via formation of the mitochondria permeability 
transition (MPT), although it is apparently not true in all cases. An obvious 
manifestation of the formation of the MPT is coUapse of the mitochondrial membrane 
potential. Inhibition of MPT by pharmacological intervention or mitochondrial 
expression of the anti-apoptotic protein Bcl-2 prevents cell death, suggesting tiie 
formation of flie MPT may be a rate-limiting event of the death process (For review 
see: Kroemer et al. 1998. Anna Rev Physiol. 60:619-42). It has also been observed that 
mitochondria can proliferate during stimulation of apoptosis (Mancini et al. 1997. J 
Cell Biol. 138:449-69; Camilleri-Broet et al. 1998. fig? Cell Res. 239:277-92). 
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One approach for measuring apoptosis-induced changes in mitochondria is to 
measur. fte mitochondrial membr^e potential. Of the methods available, the simplest 
measure is the redistribution of a cationic dye that distributes within intracellular 
organenes based on the membrane potential. Such an approach traditionaUy requires 
live cells for the measurements. The recent introduction of the MitoTracker dyes (Foot 
et al. 1997. Cytometry. 27:358-64; available fiom Molecular Probes. Inc.. Oregon) 
provides a means of measuring mitochondrial membrane potential after fixation 

Given the observations of a possible increase in mitochondrial mass during 
apoptosis. the amount of dye labeling the mitochondria is related to both membrane 
potential and the number of mitochondria. If the number of mitochondria r^ains 
constant then the amount of dye is directly related to the membr^e potential If the 
number of mitochondria is not constant, then the signal will likely be dominated by the 
increase m mass (Reipert et al. 1995. Exp Cell Res. 221:281-8). 

Probes are available that allow a clear separation between changes in mass and 
potential in HCS as^ys. Mitochondrial mass is measured directly by labeling with 
Mitotracker Green FM (Foot and Pierce. 1999. Cytometry. 35:311-7; available fiom 
Molecular Probes. Inc.. Oregon). The labeling is independent of mitochondrial 
membrane potential but proportional to mitochondrial mass. Tins also p„,vides a 
means of normalizing other mitochondrial measures in each cell with respect to 
mitochondrial mass. 



Materia] and Methods 

Cells were plated into 96-well plates at densities of 3 x 10^ to 1 x 10* cells/well 
The following day apoptotic inducers were added at the indicated concentrations and 
cells were incubated for the indicated time periods (usually 16-30 hours). CeUs were 
stamed with 5 ^gAnl Hoechst (Molecular Probes. Inc.) and 750 nM MitoTracker Red 
(CMXRos. Molecular Probes. Inc.) in fresh medium and incubated for 30 minutes at 
370C. Cells were washed in HBSS and fixed with 3.7% formaldehyde m HBSS at room 
temperature. Plates were washed with HBSS and penneabilized with O.50/0 v/v Triton 
X-IOO m HBSS at room temperature. Cells we,^ washed 2X with HBSS at room 
temperature and the plate was sealed. For dual labeling of mitochondria, cells were 
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treated with 200 nM Mitotracker Green and 200 nM Mitotracker Red for 0.5 houis 
before fixation. 



Results & Discussion 

Induction of apoptosis by staurosporine and paclitaxel led to varying 
mitochondrial changes depending upon the stimulus. L929 cells exhibited a clear 
increase in mitochondrial mass with inciting staurosporine concentrations (Fig. 28) 
BHK cells exhibited either a decrease in membrane potential at lower concentrations of 
staurosporme. or an increase in mass at higher concentrations of staurosporine (Fig 
280. MCF.7 cells responded by a consistent decrease in mitochondrial membrane 
potential m response to inciting concentrations of staurosporine (Fig 28E) 
Increasing concentrations of paclitaxel caused consistent increases in mitochondrial 
mass (Fig 28B, 28D, and 28F). 

The mitochondrial membrane potential is measured by labeling mitochondria 
with both Mitotracker Green FM and Mitotracker Red (Molecular Probes Inc) 
MitotiBcker Red labeling is proportional to both mass and membrane potential 
Mitotracker Green FM labeling is proportional to mass. ITie ratio of Mitotracker Red 
signal to the Mitotracker Green FM signal provides a measure of mitochondrial 
membrane potential (Foot and Pierce. 1999). This ratio nomializes the mitochondrial 
mass with respect to the Mitotracker Red signal. (See Figure 28G) Combining the 
ability to normalize to mitochondrial mass with a measure of the membrane potential 
allows independent assessment of both parameters. 

Result of Evaluation: Both decreases in potential and increases in mass were observed 
depending on the cell line and inducer tested. Dose dependent correlation demonstrates 
that this IS a promising ^optotic indicator. 

It is possible to combine multiple measure, of apoptosis by exploiting the 
spectral domain of fluorescence spectroscopy. In feet, all of the nuclear morphology/f- 
actm content/mitochondrial mass/mitochondrial potential data shown earUer were 
collected as multiparameter assays, but were presented individually for clarity. 
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Example 7. Protease induced translocation of a signaling enzyme containing a 
disease-associated sequencefrom cytoplasm to nucleus. 

Plasmid construct. A eukaiyotic expression plasmid containing a coding 
sequence for a green fluorescent protein - caspase (Cohen (1997). Biochemical J. 
326:1-16; Liang et al. (1997). J. o/Molec. Biol. 274:291-302) chimera is prepared using 
GFP mutants. The constmct is used to transfect eukaryotic cells. 

CeU preparation and transfection. Cells are tiypsinized and plated 24 h prior 
to transfection and incubated at 37»C and 5% CO.. Transfections are perfonned by 
methods includmg. but not limited to calcium phosphate coprecipitation or lipofection 
Cells are mcubated with the calcium phosphate-DNA precipitate for 4-5 hour, at 37»C 
and 50/0 CO,, washed 3-4 times with DMEM to remove the precipitate, followed by the 
addition of C-DMEM. Lipofectamine transfections are performed in sermn-free 
DMEM without antibiotics according to the manufacturer's instructions. Following a 
2-3 hour incubation with the DNA-liposome complexes, the medium is removed and 
nqjlaced with C-DMEM. 

Apopototic induction of Caspase-GFP translocation. To obtain Caspase-GFP 
translocation kinetic data, nuclei of transfected cells are first labeled with 5 ^g/ml 
Hoechst 33342 (Molecular Prebes) in C-DMEM for 20 minutes at 3rc and 5% CO. 
Cells are washed once in Hank's Balanced Salt Solution (HBSS) followed by the 
addition of compounds that induce apoptosis. These compounds include, but are not 
Imnted to paclitaxel. stauiosporine. ceramide. and tumor necrosis factor. To obtain 
fixed time point titration data, transfected cells are first washed with DMEM and then 
mcubated at 37»C and 50/0 CO. for 1 h in the presence of 0 - 1000 nM compound in 
DMEM. Cells are analyzed Uve or they are rinsed with HBSS. fixed for 15 min with 
3.7% formaldehyde in HBSS. stained with Hoechst 33342. and washed before analysis. 

Image acquisition and analysis. Kinetic data are coUected by acquiring 
fluorescence image pairs (Caspas^FP and Hoechst 33342-labeled nuclei) fiom fields 
of hv,ng ceUs at 1 min intervals for 30 min after the addition of compound. Likewise 
unage pairs are obtained fix>m each well of the fixed time point screening plates 1 h 
after the addition of compound, m both cases, the image pairs obtained at each time 
pomt are used to define nuclear and cytoplasmic regions in each cell. Translocation of 
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Caspase-GFP is calculated by dividing the integrated fluorescence intensity of Caspase- 
GFP in the nucleus by the integrated fluorescence intensity of the chimera in the 
cytoplasm or as a nuclear-cytoplasmic difference of GFP fluorescence. In the fixed 
time point screen this translocation ratio is calculated from data obtained fiom at least 

5 200 cells at each concentration of compound tested. Drug-induced translocation of 
Caspase-GFP fiom the cytoplasm to the nucleus is therefore correlated with an increase 
in the translocation ratio. Molecular interaction libraries including, but not limited to 
those comprising putative activators or inhibitor of apoptosis-activated enzymes are 
use to screen the indicator cell Unes and identify a specific ligand for the DAS. and a 

) pathway activated by compound activity. 

Examples. Identification ofnovel steroid receptors from DAS 

Two sources of material and/or information are required to make use of this 
embodiment, which allows assessment of the fimction of an uncharacterized gene. 
First, disease associated sequence bank(s) containing cDNA sequences suitable for 
transfection into mammalian cells can be used. Because every RADE or differential 
expression experiment generates up to several hmidred sequences, it is possible to 
generate an ample supply of DAS. Second, information fiom primary sequence 
database searches can be used to place DAS into broad categories, including, but not 
limited to, those that contain signal sequences, seven trans-membrane motife. 
conserved protease active site domains, or other identifiable motifs. Based on the 
information acquired fiom these sources, method types and indicator ceU lines to be 
transfected are selected. A large number of motifi; are ah««ly weU characterized and 
encoded in the linear sequences contained within the large number genes in existing 
genomic databases. 

In one embodiment, the following steps are taken: 

.«,rr-hJ?,c^°^°° ^'"v ^ identification experiment (including database 
searches) s used as the basis for selecting the relevant biolo^cal pixxLes (for 

SScttlt eS-f ' """^ ""^ ^'^'^ ~ 

seauenc2= 7°^^^^ °^ "^"^^ °' identifiable motife (ie. signal 

sequenc^ 7-. transmembrane domams. conserved pmtease active site domains. ^ 
This mrtial groupmg will determine fluorescent tagging strategies, host cell lines, 
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m^^^r cell lines, and banks of bioactive molecules to be screened, as described 



3) Using well estabUshed molecular biology methods, lieate DAS into ,n 
expression vector designed for this purpose. GeneSized expr;sSf ^nt^ 
promoters, enhancers, and terminators for which to deliver ta^et sc^u^l ft" S 
for trans.en repression. Such vectors may also contain antitody togginrs«,ueL« 
du^t association sequences, chromophore fusion sequences like GFpf S t^ feS 
detection when expressed by the host , cic. lo racuitate 

4) Transiently transfect cells with DAS containing vectors using standard 

S TdeS^^!^"'"^^;^^^^ iprecipita^^riST^e 
^ ? . ^ mediated, polycationic mediated, viral mediated, or 

Reoperation, and plate into microtiter plates or microwel?arTays. S^vely 
transfecbon can be done direcUy in the microtiter plate itself. '^lemanveiy, 

5) Cany out the cell screening methods as described supra. 

In this embodiment. DAS shown to possess a motif(s) suggestive of 
transcriptional activation potential (for example. DNA binding domain, amino terminal 
modulating domain, hinge region, or carboxy terminal ligand binding domain) arc 
utilized to identify novel steroid receptors. 

Defining the fluorescent tags for this experiment involves identification of the 
nucleus through staining, and tagging the DAS by creating a GFP chimera via insertion 
of DAS into an expression vector, proximally fiised to the gene encoding GFP. 
Alternatively, a single chain antibody fiagment with high affinity to some portion of the 
expressed DAS could be constructed using technology available in the art (Cambridge 
Antibody Technologies) and linked to a fluorophore (FITQ to tag the putative 
transcriptional activator/receptor in the cells. This alternative would provide an 
external tag requiring no DNA transfection and therefore would be useful if distribution 
data were to be gathered from the origmal primary cultures used to generate the DAS. 

Plasmid construct. A eukaiyotic expression plasmid containing a coding 
sequence for a green fluorescent protein - DAS chimera is prepared using GFP 
mutants. The construct is used to transfect HeLa cells. The plasmid, when transfected 
into the host cell, produces a GFP fiised to the DAS protein product, designated GFP- 
DASpp. 
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CeU preparation and transfection. HeLa cells are tiypsinized and plated using 
DMEM containing 5% charcoal/dextran-treated fetal bovine serum (FBS) (Hyclone) 
and 1% penicillin-streptomycin (C-DMEM) 12-24 hours prior to transfection and 
incubated at ZTC and 5% CO2 . Transfections are performed by calcium phosphate 
coprecipitation or with Lipofectamine (Life Technologies). For the calcium phosphate 
transfections, the medium is replaced, prior to transfection, with DMEM containing 5% 
charcoal/dextran-treated FBS. Cells are incubated with the calcium phosphate-DNA 
precipitate for 4-5 hours at 37*C and 5% CO2, and washed 3-4 times with DMEM to 
remove the precipitate, followed by the addition of C-DMEM, Lipofectamine 
transfections are performed in serum-free DMEM without antibiotics according to the 
manufacturer's instructions. Following a 2-3 hour incubation with the DNA-liposome 
complexes, the medium is removed and replaced with C-DMEM, All transfected cells 
in 96-well microtiter plates are incubated at 33°C and 5% CO2 for 24-48 hours prior to 
drug treatment Experiments are performed with the receptor expressed transiently in 
HeLa cells. 

Localization of expressed GFP~DASpp inside cells. To obtain cellular 
distribution data, nuclei of transfected cells are first labeled with 5 ^g/ml Hoechst 
33342 (Molecular Probes) in C-DMEM for 20 minutes at 33°C and 5% CO2. Cells are 
washed once in Hank*s Balanced Salt Solution (HBSS). The cells are analyzed live or 
they are rinsed with BBSS, fixed for 15 min with 3.7% formaldehyde in HBSS, stained 
with Hoechst 33342, and washed before analysis. 

In a preferred embodiment, image acquisition and analysis are performed using 
the cell screening system of the present invention. The intracellular GFP-DASpp 
fluorescence signal is collected by acquiring fluorescence image pairs (GFP-DASpp 
and Hoechst 33342-labeled nuclei) fit)m field cells. The image pairs obtained at each 
time point are used to define nuclear and cytoplasmic regions in each cell. Data 
demonstrating dispersed signal in the cytoplasm would be consistent with known 
steroid receptors that are DNA transcriptional activators. 

Screening for induction of GFP-DASpp translocation. Using the above 
construct, confirmed for ^propriate expression of the GFP-DASpp, as an indicator cell 
line, a screen of various ligands is performed using a series of steroid type ligands 
including, but not limited to: estrogen, progesterone, retinoids, growth factors, 
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androgens, and many other steroid and steroid based molecules. Image acquisition and 
analysis are performed using the cell screening system of the invention. The 
intracellular GFP-DASpp fluorescence signal is collected by acquiring fluorescence 
image pairs (GFP-DASpp and Hoechst 33342-labeled nuclei) from fields cells. The 

s image pairs obtained at each time point are used to define nuclear and cytoplasmic 
regions in each cell. Translocation of GFP-DASpp is calculated by dividing the 
integrated fluorescence intensity of GFP-DASpp in the nucleus by the integrated 
fluorescOTce intensity of the chimera in the cytoplasm or as a nuclear-cytoplasmic 
difference of GFP fluorescence. A translocation fi^om the cytoplasm into the nucleus 

10 indicates a ligand binding activation of the DASpp thus identifying the potential 
receptor class and action. Combining this data with other data obtained in a similar 
fashion using known inhibitors and modifiers of steroid receptors, would either validate 
the DASpp as a target, or more data would be generated from various sources. 



15 Example 9 Additional Screens 

Translocation between the plasma membrane and the cytoplasm: 

Profilactin complex dissociation and binding of profilin to the plasma 
membrane. In one embodiment, a fluorescent protein biosensor of profilin membrane 
binding is prepared by labeling purified profilin (Federov et al.(1994), J] Molec, Biol. 

20 241:480-482; Lanbrechts et al. (1995), Euk J. Biochem. 230:281-286) with a probe 
possessing a fluorescence lifetime in the range of 2-300 ns. The labeled profilin is 
introduced into living indicator cells using bulk loading methodology and the indicator 
cells are treated with test compounds. Fluorescence anisotropy imaging microscopy 
(Gough and Taylor (1993), J, Cell BioL 121:1095-1107) is used to measure test- 

25 compound dependent movement of the fluorescent derivative of profilin between the 
cytoplasm and membrane for a period of time after treatment ranging from 0.1 s to 10 
h. 

Rho-RhoGDI complex translocation to the membrane. In another 

embodiment, indicator cells are treated with test compounds and then fixed, washed, 

30 and penmeabilized. The indicator cell plasma membrane, cytoplasm, and nucleus are 

all labeled with distinctly colored maricers followed by unmunolocalization of Rho 

protein (Self et al. (1995), Methods in Enzymology 256:3-10; Tanaka et al. (1995), 
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Methods in En^ymology 256:4M9) with antibodies labeled with a fourth color Each 
of the four labels is imaged s^arately using the cell screening system, and the images 
used to calculate the amount of inhibition or activation of translocation effected by the 
test compound. To do this calculation, the images of the probes used to marie the 
plasma membrane and cytoplasm are used to mask the image of the immunological 
probe marking the location of intracellular Rho pn,tein. TT,e integrated brightness per 
unit area under each mask is used to form a tnmslocation quotient by dividing the 
plasma membrane integnUed brightness/ar^ by the cytoplasmic integrated 
bnghtness/area. By comparing the translocation quotient values from control and 
experimental wells, the percent translocation is calculated for each potential lead 
compound. 



^-Arrestin translocation to the plasma membrane upon G-protein receptor activation. 

In another embodiment of a cytoplasm to membrane translocation high-content 
screen, the translocation of P-arrestin protein fiom the cytoplasm to the plasma 
membrane is measured in response to cell treatment To measure the translocation, 
livmg indicator cells containing luminescent domain markers are treated with test 
compounds and the movement of the p-arrestin marker is measured in time and space 
usmg the cell screening system of the present invention. In a preferr«l embodiment, 
the mdicator cells contain luminescent markers consisting of a green fluorescent protein 
P-arrestin (GFP-P-an-estin) protein chimera (Barak et al. (1997). J. Biol. Chem 
272:27497-27500; Daaka et al. (1998). J. Biol. Chem. 273:685-688) that is expressed 
by the indicator cells through the use of tr^ient or stable cell transfection and other 
reporters used to mark cytoplasmic and membi^e domains. When &e indicator ceUs 
are m the resting state, the domain maricer molecules partition predominately in the 
plasma membrane or in the cytoplasm. In the high-content sci^ these maricers are 
used to delineate the cell cytoplasm and plasma membrane in distinct chamiels of 
fluorescence. When the indicator ceUs are treated with a test compound, the dynamic 
redistribution of the GFP.p-arrestin is recorded as a series of images over a time scale 
rangmg fiom 0.1 s to 10 h. In a preferred embodiment, the time scale is 1 h. Each 
unage is analyzed by a method that quantifies the movement of the GFP-p-arrestin 
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protein chimera between the plasma membrane and the cytoplasm. To do this 
calculation, the images of the probes used to marie the plasma membrane and cytoplasm 
are used to mask the image of the GFP-P-arrestin probe marking the location of 
intracellular GFP-p-anestin protein. Tie integrated brightness per unit area under each 
mask is used to form a translocation quotient by dividing the plasma membrane 
integrated brightness/area by the cytoplasmic integrated brightness/area. By comparing 
the translocation quotient values from control and experimental wells, the percent 
translocation is calculated for each potential lead compound. The output of the high- 
content screen relates quantitative data describing the magnitude of the translocation 
within a large number of individual cells that have been treated with test compounds of 
interest. 

Translocation between the endoplasmic reticulum and the Golgi: 

In one embodiment of an endoplasmic reticulum to Golgi translocation high- 
content screen, the translocation of a VSVG protein from the ts045 mutant strain of 
vesicular stomatitis virus (Ellenberg et al. (1997), J. Cell Biol 138:1193-1206; Presley 
et al. (1997) Nature 389:81-85) from the endoplasmic reticulum to the Golgi domain is 
measured in response to cell treatment To measure the translocation, indicator cells 
containing luminescent reporters are treated with test compounds and the movement of 
the reporters is measured in space and time using the cell screening jq^tem of the 
present invention. The indicator cells contain luminescent reporters consisting of a 
GFP-VSVG protein chimera that is expressed by the indicator ceU through the use of 
transient or stable cell transfection and other domam markers used to measure the 
localization of the endoplasmic reticulum and Golgi domains. When the indicator cells 
are in their resting state at 40'>C, the GFP-VSVG protein chimera molecules are 
partitioned predominately in the endoplasmic reticulum. In this high-content screen, 
domain maricers of distinct colors used to delineate the endoplasmic reticulum and the 
Golgi domains in distinct channels of fluorescence. When the indicator cells are treated 
with a test compound and the temperature is simultaneously lowaed to 32°C, the 
dynamic redistribution of the GFP-VSVG protein chimera is recorded as a series of 
images over a time scale ranging from 0.1 s to 10 h. Each image is analyzed by a 
method that quantifies the movement of the GFP-VSVG protein chimera between the 
endoplasmic reticulum and the Golgi domains. To do this calculation, the images of 
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the probes used to mark the endoplasnuc reticulum and the Golgi domains are used to 
mask the image of the GFP-VSVG probe marking the location of intraceUular GFP- 
VSVG protein. The integrated brightness per unit area under each mask is used to form 
a translocation quotient by dividing the endoplasmic reticulum integrated 
bnghtness/area by the Golgi integrated brightness/ar^a. By comparing the translocation 
quotient values from control and experimental wells, the percent translocation is 
calculated for each potential lead compomid. The output of the high-content screen 
relates quantitative data describing the magnitude of the translocation within a large 
number of individual cells that have been treated with test compounds of interest at 
fmal concentrations ranging fiom lO"'^ M to lO'^ M for a period ranging from 1 min to 

1 A U 



10 h. 



Induction and inhibition oforganellar Junction: 
Intracellular microtubule stability. 

In another aspect of the invention, an automated method for identifying 
compounds that modify microtubule structure is provided. In this embodiment, 
indicator cells are treated with test compounds and the distribution of luminescent 
microtubule-labeling molecules is measured in space and time using a cell screening 
system, such as the one disclosed above. The luminescent microtubule-labeling 
molecules may be expressed by or added to the cells either before, together with, or 
after contacting die cells with a test compound. 

In one embodiment of this aspect of the invention, living cells express a 
luminescently labeled protein biosensor of microtubule dynamics, comprising a protein 
that labels microtubules fused to a luminescent protein. Appropriate microtubule- 
labeling proteins for this aspect of the invention include, but are not limited to o and p 
tubulin isofonns, and MAP4. Preferred embodiments of the luminescent protein 
include, but are not limited to green fluorescent protein (GFP) and GFP mutants. In a 
preferred embodiment, the method involves transfecting cells with a microtubule 
labeling luminescent protein, wherein the microtubule labeling protein can be, but is 
not lunited to, a-tubulin, p-tubulin, or microtubule^associated protein 4 (MAP4). The 
approach outlined here enables those skilled in the art to make live cell measurements 
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to detennine the effect of lead compounds on tubulin activity and nucxotubule stability 



in vivo. 



In a most preferred embodiment. MAP4 is luscd to a modified vemon of the 
Aeguorea victoria green fluorescent protein (GFP). A DNA construct has been made 
which consists of a fusion between the EGFP coding sequence (available fiom 
Clontech) and the coding sequence for mouse MAP4. (Olson et al. (1995) J CeU 
Biol. 130(3): 639-650). MAP4 is a ubiquitous microtubule-associated protein that is 
known to interact with microtubules in interphase as well as mitotic cells (Ohnsted and 
Murofushi, (1993). MAP4. In "Guidebook to the Cytoskeleton and Motor Proteins " 
OxfordUniversityPress. T. Kreis and R. Vale, eds.) Its localization, then, can serve as 
an indicator of the localization, organization, and integrity of microtubules in living (or 
fixed) ceUs at all stages of the cell cycle for cell-based HCS assays. While MAP2 and 
tau (microtubule associated proteins expressed specifically in neuronal cells) have been 
used to form GFP chimeras (Kaech et al, (1996) Neuron. 17: 1189-1199; Hall et aL 
(1997). Proc. Nat Acad. Sci. 94: 4733-4738) their restricted ceU type distf bution and 
the tendency of these proteins to bundle microtubules when overexpressed make these 
proteins less desirable as molecular reagents for analysis in live celk originating from 
varied tissues and organs. Moderate overexpression of GFP-MAP4 does not disrupt 
microtubule fimction or integrity (Olson et al., 1995). Smiilar constructs can be made 
using P-tubuUn or a-tubulin via standard techniques in the art. These chimeras will 
provide a means to observe and analyze microtobule activity in Uving cells during all 
stages of the cell cycle. 

In another embodiment, the luminescently labeled protein biosensor of 
microtubule dynamics is expressed, isolated, and added to the cells to be analyzed via 
bulk loading techniques, such as microinjection, scrape loading, and impact-mediated 
loading. In this embodiment, there is not an issue of overexpression within the cell, 
and thus a and p tubulin isofoims. MAP4. MAP2 and/or tau can all be used. 

In a further embodiment, the protein biosensor is expressed by the cell, and the 
cell is subsequently contacted with a luminescent label, such as a labeled antibody, that 
detects the protein biosensor, endogenous levels of a protein antigen, or both. In this 
embodiment, a luminescent label that detects a and p tubulin isoforms, MAP4. MAP2 
and/or tau. can be used. 
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A variety of GFP mutants are available, all of which would be effective in this 
invention, including, but not limited to, GFP mutants which are commercially available 
(Clontech, California). 

The MAP4 construct has been introduced into several mammalian ceU lines 
5 (BHK-21. Swiss 3T3. HeLa, HEK 293. LLCPK) and the organization and localization 
of tubulm has been visualized in live cells by virtue of the GFP fluorescence as an 
indicator of MAP4 localization. The construct can be expressed transiently or stable 
cell Imes can be prepared by standard methods. Stable HeLa cell lines expressing the 
EGFP-MAP4 chimera have been obtained, indicating that expression of the chimera is 
10 not toxic and does not interfere with mitosis. 

Possible selectable markers for establishment and maintenance of stable cell 
Imes include, but are not limited to the neomycin resistance gene, hygromycin 
resistance gene, zeocin resistance gene, puromycin resistance gene, bleomycin 
resistance gene, and blastacidin resistance gene. 

The utility of this method for the monitoring of microtubule assembly, 
disassembly, and reamngement has been demonstrated by treatment of transiently and 
stably transfected cells with microtubule drugs such as paclitaxel, nocodazole, 
vincristine, or vinblastine. 

The present method provides high-content and combined high throughput-high 
content cell-based screens for anti-microtubule drugs, particularly as one parameter in a 
multi-parametric cancer target screen. The EGFP-MAP4 construct used herein can also 
be used as one of the components of a high-content screen that measures multiple 
signaling pathways or physiological events. In a preferred embodiment, a combined 
high throughput and high content screen is employed, wherein multiple cells in each of 
the locations containing cells are analyzed in a high throughput mode, and only a subset 
of the locations containing cells are analyzed in a high content mode. The high 
throughput screen can be any screen that would be useful to identify those locations 
containing ceUs that should be further analyzed, including, but not limited to, 
identifying locations with increased luminescence intensity, those exhibiting 
expression of a reporter gene, those underioing calcium changes, and those 
undergoing pH changes. . 
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m addition to drug screening applications, the present invention may be applied 
to chmcal diagnostics, the detection of chemical and biological warfare weapons and 
the basic research market since fundamental ceU processes, such as cell division and 
motihty, are highly dependent upon microtubule dynamics. 

Image Acquisition andAnofysis 

Image data can be obtained ftom either fixed or living indicator cells. To 
extract morphometiic data fiom each of the images obtained tiie following method of 

analysis is used; 

1. TTireshold each nucleus and cytoplasmic image to produce a mask that has value = 
0 for each pixel outside a nucleus or cell boundary. 

2. Overlay the mask on the original image, detect each object in the field (i.e., nucleus 
or cell), and calculate its size, shape, and integrated intensity. 

3. Overlay die whole cell mask obtained above on the corresponding luminescent 
microtubule image and apply one or more of the following set of classifiers to 
determine tiie micrtotubule morphology and the effect of drugs on microtiAule 
morphology. 

Microtubule morphology is defined using a set of classifiers to quantify aspects 

of microtubule shape, size, aggregation state, and polymerization state. These 

classifiers can be based on approaches that include co-occurrence matrices, texture 

measurements, spectral methods, stiiictural methods, wavelet transfonns, statistical 

metiiods, or combinations thereof Examples of such classifiers are as follows: 

A. ^\ classifier to quantify microhibule length and width using edge 

detection methods such as tiiat discussed in Kolega et al. ((1993). Biolmaging Irist 
150) which discloses a non-automated method to determine edge stiength in individual 
cells), to calculate the total edge strength within each ceU. To normalize for cell size, 
tfie total edge strength can be divided by the ceil area to give a •'microhxbuS 
morphology value. Large microtixbule morphology values are associated with strong 
edge strength values and are therefore maximal in cells containing distinct microtixbule 
sfructiir^ Likewise, small microtubule morphology values are associated with weak 
edge stiength and are minimal in cells with depolymerized microtiibules The 
physiological range of microhibule moiphology values is set by treating cells with 
eitiier the microtiibiUe stabilizing dmg pacUtaxel (10 nM) or the microtiibule 
aepoiymenzmg drug nocodazole (10 [ig/ml). 

2. A classifier to quantify microhibule aggregation into punctate spots or 
loci usmg methodology firom the receptor internalization methods discussed simii. 
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3. A Classifier to quantify microtubule depolymerization using a measure 
of unage texture. * "^*^uic 

u i\ ^ classifier to quantify apparent interconnectivify, or branching (or 
bofli), of the microtubules. ^ 

5. Measurement of the kinetics of microtubule reorganization using the 
above classifiers on a time series of images of cells treated with test compounds. 

In a fiirther aspect, kits are provided for analyzing microtubule stability, 
comprising an expression vector comprising a nucleic acid that encodes a microtubule 
labeling protein and instructions for using the expression vector for carrying out the 
methods described above. In a preferred embodiment, the expression vector further 
comprises a nucleic acid that encodes a luminescent protein, wherein the microtubule 
binding protein and the luminescent protein thereof are expressed as a fiision protein. 
Alternatively, the kit may contain an antibody that specifically binds to the 
microtubule-labeling protein. In a fiirther embodiment, the kit includes cells that 
express the microtubule labeling protein. In a preferred embodiment, the cells are 
transfected with the expression vector. In another prefeired embodiment, the kits 
fiirther contain a compound that is known to disrupt microtubule structure, including 
but not limited to curacin, nocodazole, vincristine, or vinblastine. In another preferred 
embodiment, the kits fiirther comprise a' compound that is known to stabilize 
microtubule structure, including but not limited to taxol (paclitaxel), and 
discodermolide. 

hi another aspect, the present invention comprises a machine readable storage 
medium comprising a program containing a set of instructions for causing a cell 
screening system to execute the disclosed methods for analyzing microtubule stabiUty, 
wherein the ceU screening system comprises an optical system with a stage adapted for 
holding a plate containing cells, a digital camera, a means for directing fluorescence or 
luminescence emitted fiom the cells to the digital camera, and a computer means for 
receiving and processing the digital data fiom the digital camera. 
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High-content screens involving the Junctional localization ofmacromolecules 

Within this class of high-content screen, the functional localization of 
macromolecules in response to external stimuli is measured within living cells. 

Glycolytic enzyme activity regulation. In a preferred embodiment of a 
cellular enzyme activity high-content screen, the activity of key glycolytic regulatory 
enzymes are measured in treated cells. To measure enzyme activity, indicator cells 
containing luminescent labeling reagents are treated with test compounds and the 
activity of the reporters is measured in space and time using ceU screening system of 
the present invention. 

In one embodiment, the reporter of intracellular enzyme activity is fructose-6- 
phosphate, 2-kinase/fructose-2,6-bisphosphatase (PFK-2). a regulatory enzyme whose 
phosphorylation state indicates intracellular carbohydrate anaboUsm or catabolism 
(Deprez et al. (1997) J. Biol. Chem. 212.11269-1121 S; Kealer et al. (1996) FEBS 
Letters 395:225-227; Lee et al. (1996). Biochemistry 35:6010-6019). The mdicator 
cells contain luminescent reporters consisting of a fluorescent protein biosensor of 
PFK-2 phosphorylation. The fluorescent protein biosensor is constructed by 
introducing an environmentally sensitive fluorescent dye near to the known 
phosphorylation site of the enzyme (Deprez et al. (1997), supra\ Giuliano et al. (1995), 
supra). The dye can be of the ketocyanine class (Kessler and Wolfbeis (1991), 
Spectrochimica Acta 47A:187-192 ) or any class that contains a protein reactive moiety 
and a fluorochrome whose excitation or emission spectrum is sensitive to solution 
polarity. The fluorescent protein biosensor is introduced into the indicator cells using 
bulk loading methodology. 

Living indicator cells are treated with test compounds, at final concentrations 
ranging from lO '^ M to lO'^ M for times ranging from 0.1 s to 10 h. In a preferred 
embodiment, ratio image data are obtained from living treated indicator cells by 
collecting a spectral pair of fluorescence images at each time point To extract 
morphometric data from each time point, a ratio is made between each pair of images 
by numerically dividing the two spectral images at each time point, pixel by pixel 
Each pixel value is then used to calculate the fractional phosphorylation of PFK-2. At 
smaU fractional values of phosphorylation, PFK-2 stimulates carbohydrate catabolism. 
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At high fractional values of phosphorylation, PFK-2 stimulates carbohydrate 
anabolism. 

Protein kinase A activity and localization of subunits. In anotlier 
embodiment of a high-content screen, both the domain localization and activity of 
piotein kinase A (PKA) within indicator cells are measured in response to treatment 
with test compounds. 

The indicator cells contain luminescent reporters including a fluorescent protein 
biosensor of PKA activation. The fluorescent protein biosensor is constructed by 
mtroducing an enviromnentally sensitive fluorescent dye into the catalytic subimit of 
PKA near the site known to interact with the regulatory subunit of PKA (Harootunian 
et al. (1993), Mol. Biol, of the Cell 4:993-1002; Johnson et al. (1996). Cell 85:149-158- 
Giuliano et al. (1995). supra). The dye can be of the ketocyanine class (Kessler and 
Wolfl,eis (1991). Spectrochimica Acta 47A:187-192) or any class that contains a 
protein reactive moiety and a fluorochrome whose excitation or emission spectrum is 
sensitive to solution polarity. The fluorescent protein biosensor of PKA activation is 
introduced into the indicator cells using bulk loading methodology. 

In one embodiment. Uving indicator cells are treated with test compounds, at 
final concentrations ranging from lO '^ M to lO"' M for times ranging from 0.1 s to 10 
h. In a preferred embodiment, ratio image data are obtained from living treated 
indicator cells. To extract biosensor data from each time point, a ratio is made between 
each pair of images, and each pixel value is then used to calculate the fractional 
activation of PKA {e.g.. separation of the catalytic and regulatory subunits after cAMP 
binding). At high fractional values of activity. PFK-2 stimulates biochemical cascades 
within the living cell. 

To measure fee translocation of the catalytic subunit of PKA. indicator celk 
containing luminescent reporters are treated with test compounds and the movement of 
the reporters is measured in space and time using the cell screening system. TTie 
indicator cells contain luminescent reporters consisting of domain markers used to 
measure the localization of the cytoplasmic and nuclear domains. When the indicator 
cells are treated with a test compounds, the dynamic redistribution of a PKA 
fluorescent protein biosensor is recorded inHacellularly as a series of images over a 
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tune scale ranging fiom 0.1 s to 10 h. Each image is analyzed by a method that 
quantifies the movement of the PKA between the cytoplasmic and nuclear domains To 
do this calculation, the images of the probes used to marie the cytoplasmic and nuclear 
domains are used to mask the image of the PKA fluorescent protein biosensor TT,e 
integrated brightness per unit area under each mask is used to form a tr^location 
quotient by dividing the cytoplasmic integrated brightness/area by the nuclear 
integrated brightness/ar«a. By comparing the translocation quotient values fiom 
control and experimental wells, the percent translocation is calculated for each potential 
lead compound. The output of the high-content screen relates quantitative data 
descnbing the magnitude of the ^location within a large number of individual cells 
that have been treated with test compound in the concentr^on range of lO '^ M to 10'^ 
M. 



High^ontent screens involving the induction or inhibition of gene expression 

RNA-based fluorescent biosensors 

Cytoskeletal protein transcription and message localization. Regulation of 
the general classes of cell physiological responses including ceU-substrate adhesion, 
cell-cell adhesion, signal transduction, cell-cycle events, intermediary and signaling 
molecule metabolism, cell locomotion, cell-cell communication, and cell death can 
mvolve the alteration of gene expression. High-content screens can also be designed to 
measure this class of physiological response. 

m one embodiment, the reporter of intraceUular gene expression is an 
oligonucleotide that can hybridize with the target mRNA and alter its fluorescence 
signal. In a preferred embodiment, the oligonucleotide is a molecular beacon (Tyagi 
and Kramer (1996) Nat. BiotechnoL 14:303-308). a luminescence-based reagent whose 
fluorescence signal is dependent on intermolecular and intramolecular interactions 
The fluorescent biosensor is constructed by introducing a fluorescence energy transfer 
pair of fluorescent dyes such that there is one at each end (5' and 3') of the reagent 

T^e dyes can beof any class that containsaprotein reactive moiety and fluorochromes 
whose excitation and emission spectra overiap sufficienUy to provide fluorescence 
^lergy transfer between the dyes in the resting state, including, but not limited to 
fluorescein and rhodamine (Molecular P^bes. Inc.). In a preferred embodiment, a 
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portion of the mwsage coding for P-actin (Kislauskis et al. (1994). J. Cell Biol. 
127:441-451; McCann et al. (1997), Proc. Natl Acad ScL 94:5679-5684; Sutoh 
(1982). Biochemistry 213654-3661) is inserted into the loop region of a hairpin-shaped 
oligonucleotide with the ends tethered together due to intramolecular hybridization. At 
each end of the biosensor a fluorescence donor (fluorescein) and a fluorescence 
acceptor (ihodamine) are covalenUy bound. In the tethered state, the fluorescence 
energy transfer is maximal and therefore indicative of an unhybridized molecule. 
When hybridized with the mRNA coding for |3-actin. the tether is broken and energy 
transfer is lost The complete fluorescent biosensor is introduced into the indicator 
cells using bulk loading methodology. 

In one embodiment, living indicator cells are treated with test compounds, at 
fmal concentrations ranging fiom lO '^ M to 10"^ M for times ranging fiom 0.1 s to 10 
h. In a preferred embodiment, ratio image data are obtained from living treated 
indicator cells. To extract morphometric data from each time point, a ratio is made 
between each pair of images, and each pixel value is then used to calculate the 
fractional hybridization of the labeled nucleotide. At small fractional values of 
hybridization little expression of p-actin is indicated At high fractional values of 
hybridization, maximal expression of P-actin is indicated. Furthermore, the distribution 
of hybridized molecules within the cytoplasm of the indicator cells is also a measure of 
the physiological response of the indicator cells. 

Cell surface binding of a ligand 

Labeled insulin binding to its ceU surface receptor in living ceUs. Cells 
whose plasma membrane domain has been labeled with a labeling reagent of a 
particular color are incubated with a solution containing insulin molecules (Lee et al. 
(1997). Biochemistry 36:2701-2708; Martinez-Zaguilan et al. (1996), Am. J. Physiol 
270:C1438-C1446) that are labeled with a luminescent probe of a different color for an 
appropriate time under the appropriate conditions. After incubation, unbound insuUn 
molecules are washed away, the ceUs fixed and the distribution and concentration of the 
insulin on the plasma membrane is measured. To do this, the cell membrane image is 
used as a mask for the insulin image. The integrated intensity, from the masked insulin 
image is compared to a set of images containing known amounts of labeled insulin. 
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The amount of insulin bound to the cell is detennined fiom the standards and used in 
conjunction with the total concentration of insulin incubated with the ceU to calculate a 
dissociation constant or insulm to its cell surface receptor. 

Labeling of cellular compartments 
Whole cell labeling 

Whole cell labeling is accomplished by labeling cellular components such that 
dynamics of cell shape and motiUty of the cell can be measured over time by analyzing 

fluorescence images of cells. 

In one embodiment, small reactive fluorescent molecules are introduced into 
Imng cells. These membrane-penneant molecules both diffiise through and react with 
protein components in the plasma membrane. Dye molecules react with intracellular 
molecules to both increase the fluorescence signal emitted fi^m each molecule and to 
entrap the fluorescent dye within living cells. These molecules include reactive 
chloromethyl derivatives of aminocoumarins, hydrexycoumarins. eosin diacetate 
fluorescein diacetate, some Bodipy dye derivatives, and tetramethyhhodamine TTie 
reactivity of these dyes toward macromolecules includes free primary amino groups 
and free sulfhydryl groups. 

In another embodiment, the cell surface is labeled by allowing the cell to 
interact with fluorescently labeled antibodies or lectins (Sigma Chemical Company, St. 
Louis. MO) that react specifically with molecules on the cell surface. CeU surface 
protein chimeras expressed by the cell of interest that contain a green fluorescent 
protem, or mutant thereof, component can also be used to fluorescenUy label the entire 
cell surfece. Once the entire ceU is labeled, images of the entire cell or cell array can 
become a parameter in high content screens, involving the measurement of cell shape, 
motility, size, and growth and division. 



Plasma membrane labeling 

In one embodiment, labeling the whole plasma membrane employs some of the 
same methodology described above for labeling the entire cells. Luminescent 
molecules that label the entire ceU surface act to delineate the plasma membrane 



80 



wo 00/50872 



PCT/USOO/04794 



In a second embodiment subdomains of the plasma membrane, the extraceUuIar 
surfece. the lipid bilayer. and the intracellular surface can be labeled separately and 
used as components of high content screens. In the first embodiment, the extracellular 
surface is labeled using a brief treatment with a reactive fluorescent molecule such as 
the succinimidyl ester or iodoacetamde derivatives of fluorescent dyes such as the 
fluoresceins, rhodamines, cyanines, and Bodipys. 

In a third embodiment, the extraceUular surface is labeled using fluorescenUy 
labeled macromolecules with a high affinity for cell surface molecules. TTiese include 
fluorescenUy labeled lectins such as the fluorescein, rhodamine, and cyanine 
denvatives of lectins derived fiom jack bean (Con A), red kidney bean 
(erythroagglutinin PHA-E), or wheat germ. 

In a fourth embodiment, fluorescently labeled antibodies with a high affinity for 
cell surface components are used to label the extracellular region of the plasma 
membrane. ExtraceUular regions of ceU surface receptors and ion chamiels are 
examples of proteins Uiat can be labeled wiUi antibodies. 

In a fifth embodiment, the lipid bilayer of the plasma membrane is labeled with 
fluorescent molecules. These molecules include fluorescent dyes attached to long chain 
hydrophobic molecules Uiat interact strongly with Ae hydrophobic region in the center 
of fte plasma membrane lipid bilayer. Examples of Uiese dyes include Uie PKH series 
of dyes (U.S. 4,783.401, 4.762701. and 4.859,584; available commereially from Sigma 
Chemical Company. St Loius. MO), fluorescent phospholipids such as 
mtrobenzoxadiazole glycerophosphoeUianolamine and fluorescein-derivatized 
dihexadecanoylglycerophosphoefta-nolamine, fluorescent fatty acids such as 5-butyl- 
4,4-difluoro-4-bora-3a,4a-diaza-s-indacene-3-nonanoic acid and 1-pyrenedecanoic acid 
(Molecular Probes. Inc.), fluorescent sterols includmg cholesteiyl 4,4-difluoro-5 7- 
dimethyl-4-bora-3a.4a-dia2a-s-indacene-3-dodecanoate and cholesteryl 'l- 
pyrenehexanoate. and fluorescenUy labeled proteins that interact specificaUy witii Upid 
bilayer components such as Uie fluorescein derivative of amiexin V (Caltag Antibody 
Co. Buriingame, CA). 

In anoUier embodiment, the intracellular component of Uie ph.sma membrane is 
labeled wiUi fluorescent molecules. Examples of fliese molecules are the intracellular 
components of tiie trimeric G-protein receptor, adenylyl cyclase, and ionic transport 
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proteins. These molecules can be labeled as a result of tight binding to a fluorescenUy 
labeled specific antibody or by the incorporation of a fluorescent protein chimera that is 
comprised of a membrane-associated protein and the green fluorescent protein, and 
mutants thereof 



Endosome fluorescence labeling 

In one embodiment, ligands that are transported into cells by receptor-mediated 
endocytosis are used to trace the dynamics of endosomal organeUes. Examples of 
labeled ligands include Bodipy FL-labeled low density Upoprotein complexes, 
tetramethybhodamine transferrin analogs, and fluorescently labeled epidermal growth 
factor (Molecular Probes, Inc.) 

In a second embodiment, fluorescently labeled primary or secondary antibodies 
(Sigma Chemical Co. St. Louis, MO; Molecular Probes, Inc. Eugene, OR; Caltag 
Antibody Co.) that specifically label endosomal ligands are used to marie the 
endosomal compartment in cells. 

In a third embodunent, endosomes are fluorescently labeled in cells expressing 
protein chimeras formed by fusing a green fluorescent protein, or mutants thereof, with 
a receptor whose intemaUzation labels endosomes. Chimeras of the EGF, transferrin, 
and low density lipoprotein receptors are examples of these molecules. 

Lysosome labeling 

In one embodiment, membrane permeant lysosome-specific luminescent 
reagents are used to label the lysosomal compartment of living and fixed cells. TTiese 
reagents include the luminescent molecules neutral red, N-(3-((2,4- 
dinitrophenyl)amino)propyl)-N-(3-aminopropyl)methylamme, and the LysoTracker 
probes which report intralysosomal pH as weU as the dynamic distribution of 
lysosomes (Molecular Probes, Inc.) 

In a second embodiment, antibodies against lysosomal antigens (Sigma 
Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 
lysosomal components that are localized in specific lysosomal domains. Examples of 
these components are the degradative enzymes involved in cholesterol ester hydrolysis. 
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membrane protein proteases, and nucleases as well as the ATP-driven lysosomal proton 



pump. 

In a third embodiment, protein chimeras consisting of a lysosomal protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protem. or mutants thereof, are used to label the lysosomal domaia Examples of these 
components are the degradative enzymes involved in cholesterol ester hydrolysis 
membrane protein proteases, and nucleases as well as the AlT-diiven lysosomal proton 
pump. 



Cytoplasmic fluorescence labeling 

In one embodiment, cell permeant fluor^ent dyes (Molecular Probes Inc) 
with a reactive group are reacted with living cells. Reactive dyes including 
monobromobimane. 5-chloromethylfluorescein diacetate. carboxy fluorescein diacetate 
succmimidyl ester, and chloromethyl tetramethyhhodamine ar« examples of ceU 
permeant fluorescent dyes that are used for long term hAeUng of the cytoplasm of cells 
In a second embodiment, polar tracer molecules such as Lucifer yellow and 
cascade blue-based fluor^ent dyes (Molecular Probes. Inc.) ai^ introduced into cells 
usmg bulk loading methods and are also used for cytoplasmic labeling. 

In a third embodiment, antibodies against cytoplasmic components (Sigma 
Chemical Co.; Molecular Probes. Inc.; Caltag Antibody Co.) are used to fluor^cently 
label the cytoplasm. Examples of cytoplasmic antigens are many of the enzymes 
mvolved in intemiediary metaboUsm. Enolase. phosphofiuctokinase. and acetyl-CoA 
dehydrogenase are examples of uniformly distributed cytoplasmic antigens. 

In a fourth embodiment, protein chimeras consisting of a cytoplasmic protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protem. or mutants thereof, are used to label the cytoplasm. Fluorescent chimeras of 
umfomUy distributed proteins are used to label the entire cytoplasmic domain. 
Examples of these proteins are many of the p^teins involved in intermediary 
metabolism and include enolase. lactate dehydrogenase, and hexokinase. 

In a fifth embodiment, antibodies against cytoplasmic antigens (Sigma 
Chemical Co.; Molecular Probes. Inc.; Caltag Antibody Co.) are used to label 
cytoplasmic components that are localized in specific cytoplasmic sub^omains 
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Examples of these components are the cytoskeletal p„,teins actin. tubulin, and 
cytokeratm. A population of these proteins within cells is assembled into discrete 
structuies. which in this case, are fibrous. Fluorescence labeling of these proteins with 
antibody-based reagents therefore labels a specific sub-domain of the cytoplasm. 

hi a sixth embodiment, non-antibody-based fluorescently labeled molecules that 
mteiact strongly with cytoplasmic proteins ar^ used to label specific cytoplasmic 
components. One example is a fluorescent analog of the enzyme DNAse I (Molecular 
Probes. Inc.) Fluorescent analogs of this enzyme bind tightly and specifically to 
cytoplasmic actin. thus labehng a sub-domain of the cytoplasm. In another example 
fluorescent analogs of the mushroom toxin phalloidin or the drug paclitaxel (Moiecula^ 
Probes. Inc.) are used to label components of the actin- and microtubule-cytoskeletons. 
respectively. 

In a seventh embodiment, protein chimeras consisting of a cytoplasmic protein 
genetically fiised to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label specific domains of the cytoplasm 
Fluorescent chimeras of highly localized proteins are used to label cytoplasmic sub- 
domams. Examples of these proteins are many of the prx,teins involved in regulating 
the cytoskeleton. They include the structural proteins actin. tubulin, and cytokeratin as 
well as the regulatory proteins microtubule associated protein 4 and a-actinin. 

Nuclear labeling 

In one embodiment, membrane permeant nucleic-acid-specific luminescent 
reagents (Molecular Probes. Inc.) are used to label the nucleus of living and fixed cells. 
These reagents include cyanine-based dyes (e.g., TOTO®. YOYO®, and BOBO™) 
Phenanthidmes and acridines (e.^.. ethidium bromide, propidium iodide, and acridinJ 
orange), indoles and imidazoles ie.g., Hoechst 33258. Hoechst 33342 and 4' 6 
diamidino-2-phenyiindole). and other similar reagents (e.g.. 7-aminoactinomycin D. 
hydroxystilbamidine. and the psoralens). 

In a second embodiment, antibodies against nuclear antigens (Sigma Chemical 
Co.; Molecular Probes. Inc.; Caltag Antibody Co.) are used to label nuclear 
components that are localized in specific nuclear domains. Examples of these 
components are the macromolecules involved in maintaining DNA structure and 
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function. DNA, RNA, histones. DNA polymerase, RNA polymerase. lamins. and 
nuclear variants of cytoplasmic proteins such as actin are examples of nuclear antigens. 

In a third embodiment, protein chimeras consisting of a nuclear protein 
genetically fused to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the nuclear domain. Examples of these 
proteins are many of the proteins involved in maintaining DNA structure and function. 
Histones, DNA polymerase. RNA polymerase, lamins. and nuclear variants of 
cytoplasmic proteins such as actin are examples of nuclear proteins. 

Mitochondrial labeling 

In one embodiment, membrane permeant mitochondrial-specific Imninescent 
reagents (Molecular Probes. Inc.) are used to label the mitochondria of living and fixed 
cells. These reagents include rhodamine 123. tetramethyl rosamine. JC-1. and the 

MitoTracker reactive dyes. 

In a second embodiment, antibodies against mitochondrial antigens (Sigma 
Chemical Co.; Molecular Probes. Inc.; Caltag Antibody Co.) are used to label 
mitochondrial components that ar« localized in specific mitochondrial domains. 
Examples of these components are the macromolecules involved in maintaining 
mitochondrial DNA structure and function. DNA. RNA, histones. DNA polymerase. 
RNA polymerase, and mitochondrial variants of cytoplasmic macromolecules such as' 
mitochondrial tRNA and rRNA ar^ examples mitochondrial antigens. Other examples 
of mitochondrial antigens are the components of die oxidative phosphorylation system 
found in the mitochondria (e.g., cytochrome c. cytochrome c oxidase, and succinate 
dehydrogenase). 

In a third cmbodunent. protein chimeras consisting of a mitochondrial protein 
genetically fused to an intrinsically luminescent prx)tein such as the green fluorescent 
protein, or mutants thereof are used to label the mitochondrial domain. Examples of 
these components are the macromolecules involved in maintaming mitochondrial DNA 
structure and fimction. Examples include histones. DNA polymerase. RNA 
polymerase, and the components of the oxidative phosphorylation system found in the 
mitochondria (e.g., cytochrome c. cytochrome c oxidase, and succinate 
dehydrogenase). 
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Endoplasmic reticulum labeling 

In one embodiment, membrane permeant endoplasmic reticulum-specific 
luminescent reagents (Molecular Probes, Inc.) are used to label the endoplasmic 
reticulum of Uving and fixed cells. These reagents include short chain caibocyanine 
dyes (e.g., DiOCe and DiOCj), long chain caibocyanine dyes (e.g., DUCie and DilCig), 
and luminescently labeled lectins such as concanavalin A. 

In a second embodiment, antibodies against endoplasmic reticulum antigens 
(Sigma Chemical Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label 
endoplasmic reticulum components that are localized in specific endoplasmic reticulum 
domains. Examples of these components are the macromolecules involved in the fatty 
acid elongation systems, glucose-6-phosphatase, and HMG CoA-reductase. 

In a third embodiment, protein chimeras consisting of a endoplasmic reticulum 
protein genetically fused to an intrinsically luminescent protein such as the green 
fluorescent protein, or mutants thereof are used to label the endoplasmic reticulum 
domain. Examples of these components are the macromolecules involved in the fatty 
acid elongation systems, gIucose-6-phosphatase, and HMG CoA-reductase. 
Golgi labeling 

In one embodiment, membrane permeant Golgi-specific luminescent reagents 
(Molecular Probes, Inc.) are used to label the Golgi of living and fixed cells. These 
reagents include luminescently labeled macromolecules such as wheat germ agglutinin 
and Brefeldin A as well as luminescently labeled ceramide. 

In a second embodiment, antibodies against Golgi antigens (Sigma Chemical 
Co.; Molecular Probes, Inc.; Caltag Antibody Co.) are used to label Golgi components 
that are localized in specific Golgi domains. Examples of these components are N- 
acetylglucosamine phosphotransferase, Golgi-specific phosphodiesterase, and 
mannose-6-phosphate recqjtor protein. 

In a third embodiment, protein chimeras consisting of a Golgi protein 
genetically fiised to an intrinsically luminescent protein such as the green fluorescent 
protein, or mutants thereof, are used to label the Golgi domain. Examples of these 
components are N-acetylglucosamine phosphotransferase, Golgi-specific 
phosphodiesterase, and mannose-6-phosphate receptor protein. 
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While many of the examples presented involve the measurement of single 
cellular processes, this is again is intended for purposes of iUustration only. Multiple 
parameter high-content screens can be produced by combining several single parameter 
screens into a multiparameter high-content screen or by adding cellular parameters to 
any existing high-content screen. Furthennore. while each example is described as 
being based on either live or fixed cells, each high-contrait screen can be designed to be 
used with both live and fixed cells. 

Those skilled in the art will recognize a wide variety of distinct screens that can 
be developed based on the disclosure provided herein. There is a large and growing list 
of known biochemical and molecular processes in cells that involve translocations or 
reorganizations of specific components within cells. The signaling pathway from the 
cell surface to target sites within the cell involves the translocation of plasma 
membrane-associated proteins to the cytoplasm. For example, it is known that one of 
the src family of protein tyrosine kinases, pp60c-src (Walker et al (1993), J, Biol. 
Chem. 268:19552-19558) translocates from the plasma membrane to the cytoplasm 
upon stimulation of fibroblasts with platelet-derived growth factor (PDGF). 
Additionally, the targets for screening can themselves be converted into fluorescence- 
based reagents that report molecular changes including ligand-bmding and post- 
translocational modifications. 

Example 10. Protease Biosensors 
(1) Backgronnd 

As used herein, the following terms are defined as follows: 

• Eeactant - the parent biosensor that interacts with die proteolytic enzyme. 

• Product - the signal-containing proteolytic fi:agment(s) generated by the interaction 
of the reactant with the enzyme. 

• Reactant Tarpet Sequencp - an amino add sequence that imparts a restriction on the 
cellular distribution of the reactant to a particular subcellular domain of the cell. 

• Product Tarpet f^t^nmcf. - an amino acid sequence that imparts a restriction on the 
cellular distribution of the signal-containing product(s) of the targeted enzymatic 
reaction to a particular subceUular domain of the ceU. If the product is initially 
localized within a membrane bound compartment, then the Product Target 
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Sequence must incorporate the abiUty to export the product out of the membrane- 
bound compartment A bi-functional sequence can be used, which first moves the 
product out of the membrane-bound compartment, and then targets the pn>duct to 
the final compartment In general, the same amino acid sequences can act as either 
or both reactant target sequences and product target sequences. Exceptions to this 
include amino acid sequences which target the nuclear envelope. Golgi apparatus 
endoplasmic reticuulum. and which are involved in famesylation, which are mor^ 
suitable as reactant target sequaices. 
• Protease Recopnition Sit e - an amino acid sequence that imparts specificity by 
mmiicking the substrate, providing a specific binding and cleavage site for a 
protease. Although typically a short sequence of amino acids representing the 
minimal cleavage site for a protease (e.g. DEVD for caspase-3. Villa. P SH 
Kaufinann, and W.C. Eamshaw. 1997. Caspases and caspase inhibitors. Trends 
Biochem Sci. 22:388-93). greater specificity may be established by using a longer 
sequence fiiom an established substrate. 
' CsmEartment - any cellular sub-structure or maciomolecular component of the cell 
whether it is made of protein, lipid, carbohydrate, or nucleic acid. It could be a 
maciomolecuJar assembly or an o^ganeUe (a membrane delimited cellular 
component). Compartments include, but are not lunited to. cytoplasm, nucleus 
nucleolus, imier and outer surface of nuclear envelope, cytoskeleton. peroxisome 
endosome. lysosome, inner leaflet of plasma membrane, outer leaflet of plasma 
membrane, outer leaflet of mitochondrial membrane, inner leaflet of mitochondrial 
membrane. Golgi. endoplasmic reticulum, or extracellular space. 
Signal - an amino acid sequence that can be detected. This includes, but is not 
Umited to inherently fluorescent proteins (e.g. Green Fluorescent Protein), cofactor- 
requiring fluorescent or luminescent proteins (e.g. phycobUiproteins or luciferases) 
and epitopes recognizable by specific antibodies or other specific natural or 
unnatural binding probes, including but not limited to dyes, enzyme cofectors and 
engmeered binding molecules, which are fluorescently or luminescenUy labeled. 
Also included are site-specifically labeled proteins that contain a luminescent dye 
Methodology for site-specific labeling of protems includes, but is not limited to 
engineered dye-reactive amino acids (Post et al.. J. Biol. Chem. 269:12880-12887 
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(1994)), enzyme-based mcorporation of luminescent substrates into proteins 
(Buckler, et al.. Analyt. Biochem. 209:20-31 (1993); Takashi. Biochemistry. 
27:938-943 (1988)), and the incorporation of unnatural labeled amino acids into 
proteins (Noren. et al.. Science. 244:182-188 (1989)). 
• Detection - a means for recording the presence, position, or amount of the signal. 
The approach may be direct, if the signal is inherently fluorescent, or indirect, if, for 
example, the signal is an epitope that must be subsecpiently detected with a labeled 
antibody. Modes of detection mclude, but are not limited to, the spatial position of 
fluorescence, luminescence, or phosphorescence: (1) intensity; (2) polarization; (3) 
lifetime; (4) wavelength; (5) energy transfer, and (6) recovery after photobleaching. 
The basic principle of the protease biosensors of the present invention is to 
spatially separate the reactants from the products generated during a proteolytic 
reaction. The separation of products from reactants occurs upon proteolytic cleavage of 
the protease recognition site within the biosensor, allowing the products to bind to, 
15 difRise into, or be imported into compartments of the cell different from those of the 
reactant. This spatial separation provides a means of quantitating a proteolytic process 
directly in living or fixed ceUs. Some designs of the biosensor provide a means of 
restricting the reactant (uncleaved biosensor) to a particular compartment by a protein 
sequence ("reactant target sequence") that binds to or imports the biosensor into a 
20 compartment of the cell. These compartments include, but are not limited to any 
cellular substructure, macromolecular cellular component, membrane-limited 
organeUes, or the extracellular space. Given that the characteristics of the proteolytic 
reaction are related to product concentration divided by the reactant concentration, the 
spatial separation of products and reactants provides a means of uniquely quantitating 
products and reactants in single ceUs, allowing a more direct measure of proteolytic 
activity. 

The molecular-based biosensors may be introduced into cells via transfection 
and the expressed chimeric proteins analyzed in transient ceU populations or stable cell 
lines. They may also be pre-formed. for example by production in a prokaryotic or 
eukaryotic expression system, and the purified protein inteoduced into the cell via a 
number of physical mechanisms including, but not Umited to. micro-injection, scr^e 
loading, electiwporation. signal-sequence mediated loading, etc. 
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Measurement modes may include, but are not limited to. the ratio or difference 
in fluorescence, luminescence, or phosphorescence: (a) intensity; (b) polarization; or (c) 
Ufetime between reactant and product TTiese latter modes require appropriate 
spectroscopic differences between products and reactants. For example, cleaving a 
reactant containing a Umited-mobile signal into a very smaU translocating component 
and a relatively large non-translocating component may be detected by polarization. 
Alternatively, significantly different emission lifetimes between reactants and products 
allow detection in imaging and non-imaging modes. 

One example of a family of enzymes for which this biosensor can be 
constructed to report activity is the caspases. Caspases are a class of proteins that 
catalyze proteolytic cleavage of a wide variety of targets during apoptosis. Following 
initiation of apoptosis, the Class H "downstream" caspases are activated and are the 
point of no return in the pathway leading to cell death, resulting in cleavage of 
downstream target proteins. In specific examples, the biosensors described here were 
engineered to use nuclear translocation of cleaved GFP as a measurable indicator of 
caspase activation. Additionally, the use of specific recognition sequences that 
incorporate surrounding amino acids involved in secondary structure formation in 
naturally occmring proteins may mcrease the specificity and sensitivity of this class of 
biosensor. 

Another example of a protease class for which this biosensor can be constructed 
to report activity is zinc metalloproteases. Two specific examples of this class are the 
biological toxins derived fiom Clostridial species (C. botulinum and C tetani) and 
Bacillus anthracis. (Herreros et al. In The Comprehensive Sourcebook of Bacterial 
Protein Toxins. J.E. Alouf and J.H. Freer. Eds. 2"- edition, San Diego, Academic Press. 
1999; pp 202-228.) These bacteria express and secrete zinc metalloproteases that enter 
eukaryotic cells and specificaUy cleave distinct target proteins. For example, the 
anthrax protease from Bacillus anthracis is deUvered into the cytoplasm of target cells 
via an accessory pore-fonning protein, where its proteolytic activity inactivates the 
MAP-Idnase signaling cascade tirough cleavage of mitogen activated protein kinase 
kinases 1 or 2 (MEKl or MEK2). (Leppla, S.A. In The Comprehensive Sourcebook of 
Bacterial Protein Toxins. J.E. Alouf and m Freer, Eds. 2"" edition, San Diego, 
Academic Press, 1999; pp243-263.) Ihe toxin biosensors described here take 
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advantage of the natural subceUular localization of these and other target proteins to 
achieve reactant targeting. Upon cleavage, the signal (with or without a p«,duct target 
sequence) is sqjarated from the reactant to create a high-content biosensor. 

One of skill in the art wiU recognize that the protein biosensors of this aspect of 
the mvention can be adapted to report the activity of any member of the caspase family 
of proteases, as well as any other protease, by a substitution of the appropriate protease 
imjgmtion site in any of the constructs (see Figure 29B). These biosensors can be 
used in high-content screens to detect in vivo activation of enzymatic activity and to 
.dentily specific activity based on cleavage of a known recognition motif. TMs screen 
can be used for both live cell and fixed end-point assays, and can be combined with 
additional measurements to provide a multi-parameter assay. 

nius. in another aspect the present invention provides recombinant nucleic acids 
encoding a protease biosensor, comprising: 

a. a first nucleic acid sequence that encodes at least one detectable 
15 polypeptide signal; 

b. a second nucleic acid sequence that encodes at least one protease 
recognition site, wherein the second nucleic acid sequence is operatively linked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal- 
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and 

c 



a tiiiiti nucleic acid sequence that encodes at least one reactant target 
sequence, wherein the third nucleic acid sequence is operatively linked to the second 
nucleic acid sequence that encodes the at least one protease recognition site. 

In this aspect, the first and third nucleic acid sequences are separated by the 
second nucleic acid sequence, which encodes the protease recognition site. 

In a further embodiment, the recombinant nucleic acid encoding a protease 
biosensor comprises a fourth nucleic acid sequence that encodes at least one product 
ta,:get sequence, wherein the fourth nucleic acid sequence is operatively Unked to the 
first nucleic acid sequence that encodes the at least one detectable polypeptide signal. 

In a fimher embodiment, the recombinant nucleic acid encoding a protease 
biosensor comprises a fifth nucleic acid sequence that encodes at least one detectable 
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polypeptide signal, wherein the fifth nucleic acid sequence is opeiBtively linked to the 
third nucleic acid sequence that encodes the reactant target sequence. 

In a preferred embodiment, the detectable polypeptide signal is selected 6om 
the group consisting of fluorescent proteins, luminescent proteins, and sequence 
epitopes. In a most preferred embodiment, the first nucleic acid encoding a polypeptide 
sequence comprises a sequence selected from the group consisting of SEQ ID NOS- 35 
37, 39. 41. 43. 45. 47. 49. and 51. 

In another preferred embodiment, the second nucleic acid encoding a protease 
recognition site comprises a sequence selected fiom the group consisting of SEQ ID 
NOS: 53. 55. 57. 59. 61. 63. 65. 67. 69. 71. 73. 75. 77. 79. 81. 83. 85. 87. 89 91 93 
95. 97. 99. 101. 103. 105. 107. 109. 111. 113. 115. 117. 119. and 121. In'another 
preferred embodiment, the third nucleic acid encoding a reactant target sequence 
compnses a sequence selected fiom the group consisting of SEQ ID NOS: 123 125 
127. 129, 131. 133. 135. 137. 139. 141, 143. 145. 147. 149. and 151. ' ' 

In a most preferred embodiment, the recombinant nucleic acid encoding a 
protease biosensor comprises a sequence substantially similar to sequences selected 
fiom the group consisting of SEQ ID N0S:1. 3. 5. 7. 9. 11. 13. 15. 17. 19. 21 23 25 
27. 29, 31, and 33. ' ' * 

In another aspect, the present invention provides a recombinant expression 
vector comprising nucleic acid control sequences operatively linked to the above- 
described recombinant nucleic acids. In a still fiirther aspect, the present invention 
provides genetically engineered host cells that have been transfected with the 
recombinant expression vectors of the invention. 

In another aspect, the present invention provides recombimmt protease 
biosensors comprising 

a first domain comprising at least one detectable polypeptide 
a second domain comprising at least one protease recognition 



a. 

signal; 

b. 

site; and 

c. 



a third domain comprising at least one reactant target sequence; 
wherein the first domain and the third domain are separated by L 
second domain. 
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o<-™.g aiter acavaSo. (ie: p„,tea.e cleavage). This ,edWb„do„ do« m« n»„ta . 
complete sequesttaU™ „, ^ ^ ^ ^ ^^^^ ^ 

pamally overiap tte «ae,an. dia«W„„ in *e»e „f , 

(see below). ^ 

In a prefened embodiment, the recombinant protease biosensor further 
comprisesafourth domain comprising at least oneproduct target sequ^^^^^ 

and the fet domain are operatively linked and are s^^ 
tbrddomainbytheseconddomaia m another embodiment, the recombinant protease 
biosensor further comprises a fifth domain comprising at least one detectable 
polypeptide signal, wherein the fifth domain and the third domain are operatively 
hnked and are separated from the first domain by the second domain. 

^ ^'^'^^ embodiment, the detectable polypeptide signal domain (first or 
fifth domain) is selected ftom the group consisting of fluorescent proteins, luminescent 
pxotems. and sequence epitopes. In a most preferred embodiment, the detectable 
polypepude signal domain comprises a sequence selected fiom the group consisting of 
SEQ ID NOS:36. 38. 40. 42. 44. 46. 48. 50. and 52. 

In another preferred embodiment, the second domain comprising a p„,tease 
recogmfon site comprises a sequence selected ftom the group consisting of SEQ ID 
NOS:54. 56. 58. 60. 62. 64. 66. 68. 70. 72. 74. 76. 78. 80. 82. 84. 86. 88. 90. 92. 94 96 
98 100. 102. 104. 106. 108. 110. 112. 114. 116. U, 120. and 122. In an^th^ 
preferred embodiment, the reactant and/or target sequence domains comprise a 

sequence selected fiom the gn,up consisting of SEQ ID NOS:124. 126 128 130 132 
134. 136. 138, 140. 142. 144. 146. 148. 150, and 152. ' ' ' 

In a most preferred embodiment, the recombinant protease biosensor comprises 
a sequence substantially similar to sequences selected fiom the group consisting of 
SEQIDNO:2.4.6;8. 10. 12. 14. 16. 18. 20. 22. 24. 26. 28. 30. 32. and 34 

m a st.ll fimher embodiment, the present invention provides methods and kits 
for automated analysis of cells, comprising using cells that possess the protease 
biosensors of the invention to identify compounds that affect protease activity. Tt. 
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method can be combined with the other methods of the invention in a variety of 
possible multi-parametric assays. 

m these various embodiments, the basic protease biosensor is composed of 
multiple domains, mcluding at least a first detectable polypeptide signal domain at 
least one reactant target domam, and at least one protease recognition domain, wherem 
the detectable signal domain and the reactant target domain are separated by the 
protease recognition domain. Thus, the exact order of the domains in the molecule is 
not generaUy critical, so long as the protease recognition domain separates the reactant 
target and first detectable signal domain. For each domain, one or more one of the 
specified recognition sequaices is present 

In some cases, the order of die domains in the biosensor may be critical for 
appropriate targeting of productCs) and/or reactant to the appropriate cellular 
compartment(s). For example, the targeting of products or reactants to the peroxisome 
requires that the peroxisomal targeting domain comprise the last three amino acids of 
the protein. Determination of those biosensor in which the relative placement of 
targeting domains within the biosensor is critical can be detcmrined by one of skiU in 
the art through routine experim«aitatioa 

Some examples of the basic organization of domains within the protease 
biosensor are shown in Figure 30. One of skiU in the art wiU recognize that any one of 
a wide variety of protease recognition sites, product target sequences, polypeptide 
signals, and/or product target sequences can be used in various combinations in the 
protein biosensor of the present invention, by substituting the appropriate coding 
sequences into the multi-domain construct Non-Umiting examples of such alternative 
sequences are shown in Figure 29A-29C Similarly, one of skill in the art wiU 
recognize that modifications, substitutions, and deletions can be made to the coding 
sequences and the amino acid sequence of each individual domain within the biosensor 
while retaining the fimction of die domain. Such various combinations of domains and 
modifications, substitutions and deletions to mdividual domains are within the scope of 
the invention. 

As used herein, the term "coding sequence" or a sequence which "encodes" a 
particular polypeptide sequence, refers to a nucleic acid sequence which is transcribed 
(m the case of DNA) and translated (in the case of mRNA) into a polypeptide in vitro 
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or in vivo when placed under the control of appropriate regulatory sequences. Hie 
boundaries of the coding sequence are determined by a start codon at the 5' (amino) 
terminus and a translation stop codon at the 3' (carboxy) tenninus. A coding sequence 
can include, but is not limited to, cDNA fiom prokaryotic or eukaryotic mRNA. 
genomic DNA sequences fiom prokaryotic or eukaryotic DNA, and synthetic DNA 
sequences. A transcription temiination sequence will usually be located 3' to the coding 
sequence. 

As used herein, the term DNA "control sequences" refers collectively to 
promoter sequences, ribosome binding sites, polyadenylation signals, transcription 
termination sequences, upstream regulatory domains. enham:ers, and the Uke. which 
collectively provide for the transcription and translation of a coding sequence L a host 
cell. Not all of these control sequences need always be present in a recombinant vector 
so long as the DNA sequence of interest is capable of being transcribed and translated 
^propriately. 

As used herein, the term "operatively linked" refers to an arrangement of 
elements wherein the components so described are configured so as to perform their 
usual fimction. Thus, control sequences operatively linked to a coding sequence are 
capable of effecting the expression of the coding sequence. The control sequences need 
not be contiguous with the coding sequence, so long as they fimction to dir«:t the 
expression thereof Thus, for example, mtervening untranslated yet tt^cribed 
sequences can be present between a promoter sequence and the coding sequence and 
the promoter sequence can still be considered "operatively linked" to the coding 
sequence. 

Furthermore, a nucleic acid coding sequence is operatively linked to another 
nucleic acid coding sequences when the coding region for both nucleic acid molecules 
are capable of expression in the same reading frame. The nucleic acid sequences need 
not be contiguous, so long as they are capable of expression in the same reading frame. 
Thus, for example, intervening coding regions can be present between the specified 
nucleic acid coding sequences, and the specified nucleic acid coding regions can still be 
considered "operatively linked". 

The intervening codmg sequences between the varioui domains of the 
biosensors can be of any length so long as the fimction of each domain is retained. 
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Generally, this requires that the twonlimensional and three-dimensional structure of the 
intervening protein sequence does not preclude the binding or interaction requirements 
of the domains of the biosensor, such as product or reactant targeting, binding of the 
protease of interest to the biosensor, fluorescence or luminescence of the detectable 
polypeptide signal, or bmding of fluorescently labeled epitope-specific antibodies. 

One case where the distance between domains of the protease biosensor is 
unportant is where the goal is to create a fluorescence resonance energy transfer pair In 
this case, the FRET sigmd wiU only exist if the distance between the donor and 
acceptor is sufficiently small as to allow energy transfer (Tsien, Heim and Cubbit. WO 
97/28261). The average distance between the donor and acceptor moieties should be 
between 1 mn and 10 mn with a preference of between 1 mn and 6 mn. This is the 
physical distance between donor and acceptor. The intervening sequence length can 
vary considerably since the three dimensional structure of the peptide wiU determine 
the physical distance between donor and acceptor. 

"Recombinant expression vector" includes vectors that operatively link a 
nucleic acid coding region or gene to any promoter capable of effecting expression of 
the gene product. The promoter sequence used to drive expression of the protease 
biosensor may be constitutive (driven by any of a variety of promoters, including but 
not limited to. CMV. SV40. RSV, actin, EF) or inducible (driven by any of a number of 
inducible promoters including, but not limited to, tetracycline, ecdysone. steroid- 
responsive). The expression vector must be repUcable in the host organisms either as 
an episome or by integration into host chromosomal DNA. In a preferred embodiment, 
the expression vector comprises a plasmid. However, the invention is intended to 
include any other suitable expression vectors, such as viral vectors. 

The phrase "substantiaUy similar « is used herein in reference to the nucleotide 
sequence of DNA. or the amino acid sequence of protein, having one or more 
conservative or non-conservative variations ftom tiie protease biosensor sequences 
disclosed herein, including but not limited to deletions, additions, or substitutiom; 
wherein the resulting nucleic acid and/or amino acid sequence is functionally 
equivalent to the sequences disclosed and claimed herein. Fmictionally equivalent 
sequences will fimction in substantially the same mamier to produce substantially the 
same protease biosensor as the nucleic acid and amino acid compositions disclosed and 
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clauned herein. For example, functionally equivalent DNAs encode protease 
biosensors that are the same as those disclosed herein or that have one or more 
conservative amino acid variations, such as substitutions of non-polar indues for other 
non-polar residues or charged residues for similarly charged residues, or addition 
to/deletion fiom regions of the protease biosensor not critical for functionality These 
changes include those recognized by those of skill in the art as substitutions, deletions 
and/or additions that do not substantially alter the tertiary structure of the protein. 

As used herein, substantially similar sequences of nucleotides or amino acids 
share at least about 70%.75o/o identity, more preferably 80-85% identity, and most 
preferably 90-95% identity. It is recognized, however, that proteins (and DNA or 
mRNA encoding such proteins) containing less tiian the above-described level of 
homology (due to the degeneracy of the genetic code) or that ^ modified by 
conservative amino acid substitutions (or substitution of degenerate codons) ar^ 
contemplated to be within the scope of the present invention. 

The term "heterologous" as it relates to nucleic acid sequences such as coding 
sequences and control sequences, denotes sequences that are not normaUy associated 
with a region of a recombinant construct, and/or are not normally associated with a 
particular ceU. Thus, a "heterologous" region of a nucleic acid construct is an 
Identifiable segment of nucleic acid within or attached to another nucleic acid molecule 
that is not found in association with the other molecule in nature. For example, a 
heterologous region of a construct could include a coding sequence flanked by 
sequences not found in association with the coding sequence in nature. Another 
example of a heterologous coding sequence is a construct where the coding sequence 
Itself IS not found in nature (e.g., synthetic sequences having codons different from the 
native gene). Similarly, a host ceU transformed with a construct which is not normally 
present m the host cell would be considered heterologous for purposes of this invention. 

Within this application, miless otherwise stated, the techniques utilized may be 
found in any of several well-known references such as: Molecular Cloning: A 
Laboratory Manual (Sambrook, et al.. 1989. Cold Spring Harbor Laboratory Press) 
Gene Expression Technology (Methods in Enzymology, Vol. 185, edited by d' 
CSoeddel, 1991. Academic Press. San Diego, CA). "Guide to Protein Purification" in 
Methods in Enzymology (MJ. Deutshcer, ed.. (1990) Academic Press. Inc)- PCR 
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Protocols: A Guide to Methods and Applications (Innis, et al. 1990. Academic Press 
San Diego. CA). Culture of Animal Cells: A Manual of Basic Technique Ed (R./ 
Freshney. 1987. ^o±,m). Gene Transfer and Egression Protocols 

pp. 109-128, ed. EJ. Muixay. The Humana Press Inc.. Clifton, N.J.). and the Ambion 
1998 Catalog (Ambion, Austin, TX). 

nie biosensors of the present invention are constructed and used to Hansfect 
host cells using standard techniques in the molecular biological arts. Any number of 
such techniques, all of which are within the scope of this invention, can be used to 
generate protease biosensor-encoding DNA constructs and genetically transfected host 
cells expressing the biosensors. The non-limiting examples that follow demonstrate 
one such technique for constructing the biosensors of the inventioa 

EXAMPLE OF PROTEASE BIOSENSOR CONSTRUCTION AND USE: 

In the following examples, caspase-specific biosensors with specific product 
target sequences have been constructed using sets of 4 primers (2 sense and 2 
antjsense). These primers have overlap regions at their termini, and are used for PCR 
via a primer walking technique. (Sambrook. J., Fritsch, E.F. and Maniatis, T (1989 ) 
Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press Cold 
Spnng Harbor. New York) He two sense primers were chosen to start fiom the 5' 
polylinker (Bspl) of the GFP-containing vector (Clontech, California) to the middle of 
the designed biosensor sequence. The two antisense primers start ftom a 3' GFP vector 
site (Bam HI), and overlap with the sense primers by 12 nucleotides in the middle. 

PCR conditions were as follows: 94«'C for 30 seconds for denaturation, 55»C for 
30 seconds for anneaUng. and 72«'C for 30 seconds for extension for 15 cycles. THe 
primers have restriction endonuclease sites at both ends, facilitating subsequent cloning 
of the resulting PCR product 

The resulting PCR product was gel purified, cleaved at BspEl and BamHl 
restriction sites present in the. primers, and the resulting fragment was gel purified. 
Smularly. the GFP vector (Qontech. San Francisco. CA) was digested at BspEl and 
BamHl sites in the polylinker. Ligation of the GFP vector and the PCR product was 
perfomied using standard techniques at le^C overnight. E coll cells were tr^fected 
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on LB-agar with an appropriate antibiotic. 

Cells and transfections. For DNA tnuisfection, BHK cells and MCF-7 celb 
■> ---1^ to 50-700/0 confluence in 6 wen plates containing 3.1 of mini^ 
Eagle's mediun, (MEM) with 10% fetal calf serum. 1 uM I^glutamine. 50 ^^ral 
streptomycin. 50 ^g/m, penicillin. 0.1 mM non-essential amino acids. 1 mM sodium 
pymvate and 10 Mg^ of bovine insulin (for MCF-7 cell only) at 37 -c in a 5% CO, 
incubator for about 36 hours. The cells were washed with serum fiee MEM media and 
.ncubated for 5 hours with 1 ml of transfection mixture containing 1 ^g of the 
appropriate pla^nid and 4 ^g of lipofectimine (BRL) in the serum free MEM media. 
Subsequently, the transfection medium was removed and replaced with 3 ml of normal 
culture media. The transfected cells were maintained in growth medium for at least 16 
hours before performing selection of the stable cells based on standarxi molecular 
biology methods (Ausubel. et al 1 995). 

Apoptosis assay. For apoptosis assays, the cells (BHK, MCF-7) stably 
t^sfected with the appropriate protease biosensor expression vector were plated on 
tissue culture treated 96-well plates at 50-60o/. confluence and cultirred overnight at 
37-C. 5% CO,. Varying concentrations of cis-platin, staurosporine. or pacUtaxel in 
normal culture media were freshly pr^ai^d from stock and added to cell culture dishes 
to replace the old culture media Th. cells were then observed with the cell scr^g 
system of the present invention at the indicated time points either as live cell 
expermients or as fixed end-pomt experiments. 

1. Construction of S^omain protease biosensors 

WG^r""' " """"" ° ^'^''^^ 

The design of this biosensor is outlined in Figure 31, and its sequence is shown 
mSEQroNO:lahd2. 
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Primers for Caspase 3, Product target sequence = none (CP3GFP-CYTO): 

1) TCATCATCCGGAGCTGGAGCCGGAGCTGGCCGATCGGCTGTT 
AAA TCT GAA GGA AAG AGA AAG TGT GAC GAA GTT GAT GGA ATT 
GAT GAA GTA GCA (SEQ ID NO:153) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC 

CTC CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC AGT AGA 
CAT AGT ACT TGC TAC TTC ATC (SEQ ID NO:154) 

3) TCA TCA TCC GGA GCT GGA (SEQ ID NO: 155) 

4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

This biosensor is restricted to the cytoplasm by the reactant target sequence. 
The reactant target sequence is the annexin n cytoskeletal binding domain 
(MSTVHEILCKLSLEGVHSTPPSA) (SEQ ID NO:124) (Figure 29C) (Eberhard et 
al. 1997. Mol. Biol. Cell 8:293a). The enzyme recognition site corresponds to two 
copies of the amino acid sequence DEVD (SEQ ID NO:60) (Figure 29B). which 
serves as the recognition site of caspase-3. Other examples with different numbers of 
protease recognition sites and/or additional amino acids fiom a naturally occurring 
protease recognition site are shown below. The signal domain is EGFP (SEQ ID 
NO:46) (Figure 29A) (Clontech, California). The parent biosensor (the reactant) is 
restricted to the cytoplasm by binding of the amiexin n domain to the cytoskeleton, and 
is therefore excluded from the nucleus. Upon cleavage of the protease recognition site 
by caspase 3, the signal domain (EGFP) is released fiom the reactant targeting domain 
(amiexin II), and is distributed throughout the whole volume of the cell, because it lacks 
any specific targeting sequence and is small enough to enter the nucleus passively 
(Fig 32) 

The biosensor response is measured by quantitating the effective cytoplasm-to- 
nuclear translocation of the signal (see above). Measurement of the response is by one 
of several modes, including integrated or average nuclear region intensity, the ratio or 
difFerence of die integrated or average cytoplasm intensity to integrated or average 
nuclear intensity. ITie nucleus is defined using a DNA-specific dye. such as Hoechst 
33342. 
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This biosensor pmvides ameasure of the proteolytic activity around the annexin 
n cytoskeleton binding sites within the ceU. Given the dispersed nature of the 
c^oskeleton and the effectively difiuse state of cytosolic enzymes, this provides an 
effective measure of the cytoplasm in general. 

Results & Discussion: 

Fig 32 illustrates images before and after stimulation of apoptosis by cis-platin 
m BHK cells, transfected with the caspase 3 biosensor. Hxe images clearly illustrate 
accumulation of fluorescence in the nucleus. Generation of the spatial change in 
fluor^cence is non-reversible and thus the timing of the assay is flexible. Controls for 
this biosensor include using a version in which the caspase-3-specific site has been 
omrtted. In addition, disruption of the cytoskeleton with subsequent cell rounding did 
not produce the change in fluorescence distribution. Our experiments demonstrate the 
correlation of nuclear condensation with activation of caspase activity. We have also 
tested this biosensor in MCF-7 cells. A recent report measured a peak response in 
caspase-3 activity 6 h after stimulation of MCF-7 cells with etoposide accompanied by 
cleavage of PARP (Benjamin et aL mSMol Pharmacol. 53:446-50). However 
another recent report found that MCF-7 cells do not possess caspase-3 activity and, in' 
fact, the caspase-3 gene is functionally deleted (Janicke et al. 1998. J Biol Chem 
273:9357-60). Ca^ase-3 activity was not detected with the caspase biosensor in MCF- 
7 cells after a 15 h treatment with 100 ^iM etoposide. 

Janicke et al.. (1998) also indicated that many of the conventional substrates of 
caspase-3 were cleaved in MCF-7 cells upon treatment with staurx.sporine Our 
expermients demonstrate that caspase activity can be measured using the biosensor in 
MCF-7 cells when treated with staurosporine. The maximum magnitude of the 
actrvation by staurosporine was ^proximately one-half that demonstrated with cis- 
Platm in BHK cells. This also implies that the cmrent biosensor, although designed to 
be caspase-3-specific. is indeed specific for a class of caspases rather than uniquely 
specific for caspase-3. Hie most likely candidate is caspase-7 (Janicke et al. 1998) 
These experiments also demonstrated that the biosensor can be used in multiparameter 
expermients, with the correlation of decreases in mitochondrial membrane potential, 
nuclear condensation, and caspase activation. 
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We have specificaUy tested the effects of paclitaxel on caspase activation using 
the biosensor. Caspase activity in BHK and MCF-7 cells was stimulated by paclitaxel. 
It also appears that caspase activation occurred after nuclear morphology changes. One 
caveat is that, based on the above discussions, the caspase activity reported by the 
biosensor in this assay is likely to be due to the combination of caspase-3 and, at least, 
caspase-7 activity. 

Consistent with the above results using stamx)sporine stimulation on MCF-7 
cells, paclitaxel also stimulated the activation of caspase activity. The magnitude was 
similar to that of staurosporine. This experiment used a much narrower range of 
paclitaxel than previous experiments where nuclear condensation appears to dominate 
the response. 

b. Caspase biosensor with the microtubule associated protein 4 
(MAP4) projection domain (CP8GFPNLS-SIZEPRCXI) 

Another approach for restricting the reactant to the cytoplasm is to make the 
biosensor too large to penetrate the nuclear pores Cleavage of such a biosensor 
liberates a product capable of diffusing into the nucleus. 

The additional size required for this biosensor is provided by using the 
projection domain of MAP4 (SEQ ID NO:142) (Figure 29C) (CP8GFPNLS- 
SIZEPROJ). The projection domain of MAP4 does not interact with micrx)tubules on 
its own, and, when expressed, is difiusely distributed throughout the cytoplasm, but is 
excluded from the nucleus due to its size (-120 kD). Thus, this biosensor is distinct 
from the one using the foil length MAP4 sequence, (see below) One of sWU in the art 
will recognize that many other such domains could be substituted for the MAP4 
projection domain, including but not limited to multiple copies of any GFP or one or 
more copies of any other protein that lacks an active NLS and exceeds the maximmn 
size for diflfiision into the nucleus (approximately 60 kD; Alberts, B., Br^y, D.. Ra^ 
M., Roberts, K.. Watson, J.D. (Eds.) Molecular Rinlnf^.f^K. r-,.. e^ition^ New 
Yoric: Garland publishing. 1994. pp 561-563). TTie complete sequence of the resisting 
biosensor is shown in SEQ ID NO: 3-4. A similar biosensor with a differ^it protease 
recognition domain is shown in SEQ ID NO:5-6. 
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c Caspase biosensor with a nudear export signal 
Another approach for restricting the r^tant to the cytoplasm is to actively 
-tnct the rcactant fiom the nucleus by using a nuclear export signal. Cleavage of 
^ch a biosensor Uberates a product capable of diffusing into the nucleus. 

The Bacillus antbacis bacterium expresses a zinc metalloprotease protein 
complex called anthrax protease. Human mitogen activated protein kinase 1 
(MEK 1) (Seger et al.. J. Biol. Chem. 267:25628-25631, 1992) possesses an anthrax 
pn^tease recognition site (amino acids 1-13) (SEQ ID NO:102) (Figure 29B) that is 
d^ved after amino acid 8. as weD as a nuclear export signal at amino acids 32^ 
SEQ ID NO:140^ ^^r. 29C). Human MEK 2 (Zheng and Guan. . Biol. Chem. 
268:11435-11439. 1993) possesses an anthrax p^tease ignition site comprising 
am.no acdresidues 1-16 (SEQ « NO: 104) (Figure 29B) and a nuclear export signal 
at ammo acids 36-48. (SEQ ID NO:148) (Figure 29Q. 

The anthrax protease biosensor comprises Fret25 (SEQ ID NO:48) (Figure 
29A) as the signal, the anthrax protease recognition site, and the nuclear export signal 

from MEK 1 or MEK2. (SEQ mNOS: 7-8 (MEKl); 9-10 (MEK2)) lie intact 
biosensor will be retained in the cytoplasm by virture of this nuclear export signal (eg 
reac^t target site, Upon Ceavage of the fusion protein by anthrax protease, the' 
NES w,Il be separated from the GFP aUowing the GFP to difiuse into the nucleus. 

2. Construction of 4- and 5-domain biosensors 

For all of the examples presented above for 3.domain protease biosensors, a 
product targeting sequence, including but not limited to those in Figure 29C. such as a 
nuclear localization sequence (NLS). can be operatively Unked to the signal sequence 
and thus cause the signal sequence to segregate from the reactant target domain afte^ 
pn,teolytic cleavage. Addition of a second detectable signal domain, including but not 
hnuted to those in Figure 29A. operatively linked with the reactant target domain is 
also useful in allowing measurement of the reaction by multiple means. Specific 
examples of such biosensors are presented below. 

a- 4 domain biosensors 

1. Caspase biosensors with nuclear localization sequences 
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(pcas3nlsGFP; CP3GFPNLS-CYTO): 

The design of the biosensor is outlined in Figure 33. and its sequence is shown 
m SEQ n> NO:,M2. PCR and cloning p.cedu^ .ere perfonned as described 
above, except that the following oligonucleotides were used: 
5 Primers for Caspase 3. Product target sequence = NLS (CP3GFPNLS-CYTO) : 

1) TCATCATCCGGAAGAAGGAAACGACAAAAG CGATCGGCT 
GTT AAA TCT GAA GGA AAG AGA AAG TGT GAC GAA GIT GAT GGA 
ATT GAT GAA GTA GCA (SEQ ID NO:157) 

> 2)GAAGAAGGATCCGGCACTTGGGGGTGTAGAATGAACACC 
CTC CAA GCr GAG CTT GCA CAG GAT TTC GTG GAC AGT AGA 
CAT AGT ACT TGC TAC TTC ATC (SEQ ID NO:154) 
3 ) TCA TCA TCC GGA AGA AGG (SEQ ID NO:158) 
4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

TOs biosensor is similar to that shown in SEQ ID NO:2 except upon 
recogmtion and cleavage of the protease recognition site, the product is released and the 
signal accumulates specifically in the nucleus due to the presence of a nuclear 
locahzation sequence. RRKRQK (SEQ ID NO:128) (Figure 29C)(Briggs et al J 
B.OL Chem. 273:22745. 1998) attached to the signal. A specific benefit of Ls 
construct is that the products are clearly sepa«ted fiom the reactants. TT,e reactants 
-mam m the cytoplasm, whUe the product of the enzymatic reaction is restricted to the 
nuclear compartment. response is measure, by quantitating the effective 
cytoplasm-to-nuclear translocation of the signal, as described above. 

With the presence of both product and reactant targeting sequences in the parent 
biosensor, the reactant target sequence should be dominant prior to activation (eg 
protease cleavage) of the biosensor. One way to accomplish this is by masking the' 
product targeting sequence in the parent biosensor until after protease cleavage m one 
such example, the p„,duct target sequence is functional only when relatively near the 
end Of a polypeptide chain (ie: after protease cleavage). Alternatively. ti,e biosensor 
may be designed so that its tertiary structure masks the fimction of the target sequence 
until after protease cleavage. Both of these approaches include comparing targeting 
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sequences with different relative strengths for tai^eting. Using the example of the 
nuclear localization sequence (NLS) and annexin E sequences, different strengths of 
NLS have been tried with clone selection based on cytoplasmic restriction of the parent 
biosensor. Upon activation, the product targeting sequence wiU naturally dominate the 
localization of its associated detectable sequence domain because it is then separated 
fiom the reactant tai;geting sequence. 

An added benefit of using this biosensor is that the product is tai^eted, and thus 
concentrated, into a smaller region of the cell. Thus, smaller amounts of product are 
detectable due to the increased concentration of the product TOs concentration effect 
is relatively insensitive to the cellular concentration of the reactant Hie signal-to-noise 
ratio (SNR) of such a measurement is unproved over the more dispersed distribution of 
biosensor #1. 

Similar biosensors that incorporate either the caspase 6 (SEQ ID NO:66) 
(Figure 29B) or the caspase 8 protease recognition sequence (SEQ ID NO:74) (Figure 
29B) can be made using the methods described above, but using die following primer 
sets: 

Primers for Caspase 6, Product target sequence = NLS (CP6GFPNLS- 
CYTO) 

1) TCATCATCCGGAAGAAGGAAACGACAAAAGCGATCG 
ACA AGA err GTT GAA ATT GAC AAC (SEQ ID NO:159) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC 
ACC CTC CAA GCT GAG CTT GCA CAG GAT TTC GTG GAC 
AGT AGA CAT AGT ACT GTT GTC AAT TTC (SEQ ID NO:160) 

3) TCA TCA TCC GGA AGA AGG (SEQ ID NO:158) 

4) GAA GAA GGA TCC GGC ACT (SEQ ID NO:156) 

Primers for Caspase 8, Product target sequence = NLS (CP8GFPNLS-CYTO) 

1) TCATCATCCGGAAGAAGGAAACGACAAAAGCGATCG 
TAT CAA AAA GGA ATA CCA GTT GAA ACA GAC AGC GAA GAG 
CAA CCT TAT (SEQ ID NO:161) 

2) GAA GAA GGA TCC GGC ACT TGG GGG TGT AGA ATG AAC ACC CTC 
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CAA GCr GAG CTT GCA CAG GAT TTC GTG GAC AGT AGA CAT AGT 
ACT ATA AGG TTG CTC (SEQ ro NO:162) 

3) TCA TCA TCC GGA AGA AGG (SEQ ID NO:158) 

4) GAAGAAGGATCCGGCACT(SEQIDNO:156) 

The sequence of the resulting biosensors is shown in SEQ ID NO-13 14 
(Caspase 6) and SEQ ID NO: 15-16 (Caspase 8). Furthermoi., multiple copies of the 
protease recognition sites can be inserted into the biosensor, yielding the biosensors 
shown in SEQ ID NO: 17-18 (Caspase 3) and SEQ ID NO:19.20 (Caspase 8). 

2. Caspase 3 biosensor with a second signal domain 

An alternative embodiment employs a second signal domain operatively 
hnked to the reactant target domain. In this example, full length MAP4 serves as the 
reactant target sequence. Upon recognition and cleavage, one product of the reaction, 
contaming the reactant target sequence, remains bound to microtubules in the 
cytoplasm with its own unique signal, while the other product, containing the product 
target sequence, difiuses into the nucleus. This biosensor provides a means to measur. 
two actmties at once: caspase 3 activity using a translocation of GFP into the nucleus 
and microtubule cytoskeleton mtegrity in response to signaling cascades initiated 
during apoptosis, monitored by the MAP4 reactant target sequence. 

The basic premise for this biosensor is that the reactant is tethered to the 
microtubule cytoskeleton by virtue of the reactant target sequence comprising the firll 
length microtubule associated protein MAP4 (SEQ D) NO:152) (Figure 29C) In this 
case, a DEVD (SEQ ID NO:60) (Figure 29B) r«.,gnition motif is located betwe«r the 
EYFP signal (SEQ ID NO:44) (Figure 29A) operatively linked to the reactant target 
sequence, as well as the EBFP signal (SEQ ID NO:48) (Fignre 29A) operatively 
Imked to the C-terminus of MAP4. The r«ulting biosensor is shown in SEQ ID 
NO:21-22. ^ 

This biosensor can also include a product targeting domain, such as an NLS 
operatively linked to the signal domain. 

With this biosensor, caspase-3 cleavage still releases the N-terminal GFP which 
undergoes translocation to the nucleus (directed there by the NLS). Also. the'MAP4 
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fbgment. which is still intact following proteolysis by caspase-3. continues to report on 
the integdty of the microtubule cytoskeleton during the process of apoptosis via the 
second GFP molecule iused to the C-terminus of the biosensor. TTierefore. this single 
chimeric protein allows simultaneous analysis of caspas^3 activity and the 
polymerization state of the microtubule cytoskeleton during apoptosis induced by a 
variety of agents. This biosensor is also useful for analysis of potential drug candidates 
that specifically target the microtubule cytoskeleton. since one can determine whether a 
particular drag induced apoptosis in addition to afTecting microtiibules. 

This biosensor potentiaUy combines a unique signal for the reactant, 
fluorescence resonance energy transfer (FRET) fiom signal 2 to signal I. and a unique 
signal localization for the product, nuclear accmnulation of signal 1. The amount of 
product generated will also be indicated by the magnitude of the loss in FRET, but this 
will be a smaller SNR than the combination of FRET detection of reactant and spatial 
localization of the product. 

FRET can occur when tiie emission spectiimi of a donor overlaps significantly 
the absorption specdimi of an accqjtor molecule, (dos Remedios. C.G.. and P.D. 
Moens. 1995. Fluorescence resonance energy transfer spectroscopy is a rehable "mler" 
for measuring stractural changes in proteins. Dispelling the problem of the mdcnown 
orientation factor. J Struct Biol. 1 15:175-85; EmmanouiUdou, E., A.G. Teschemacher 
A.E. Pouli, L.I. NichoUs, E.P. Seward, and G.A. Rutter. 1999. Imaging Ca(2+) 
concentration changes at the secretory vesicle surface witi, a recombinant tai^eted 
cameleon. CurrBiol. 9:915-918.) The average physical distance between the donor and 
acceptor molecules should be between 1 mnand 10 mn with a preference of between 1 
nm and 6 mn. The intervening sequence length can vary considerably since the tiuee 
(bmensional structure of the peptide will determine the physical distance between donor 
and acceptor. This FRET signal can be measured as (1) the amount of quenching of the 
donor in tire presence of the acceptor. (2) the amount of acceptor emission when 
exciting tire donor; and/or (3) the ratio between the donor and acceptor emission. 
Alternatively, fluorescent lifetimes of donor and acceptor could be measured 

niis case adds value to the above FRET biosensor by nature of the existence of 
the reactant targeting sequence. This sequence aUows the placement of the biosensor 
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into specific compa«mcn.s of *c cell for . ^ ^„ ^ ^ 

convartmenls such as the taer sur&ce of Ihe plasm, membrane. 

Tl«'=)^<'P'^««»nd signal „=pr«en,,b„4origtaal^^^ 
"»''~'»*-n»-l-rfimsig„alrep,««sanotep^„«„f^.^„^^^ 
c^nc ^cdo. has a. added flerihili., in *a. i. can be ^ed as (1, nuclear 
•n^V. (2) fte nucleus /cyu^hsm raUo; (3) U,e m^leus /cytoplasm FRET raUo- (4, 
cytoplasmic /cytoplaiiinic FRET ratio. 

He p»se« FRET biosensor design diff«s 6om preW™« FRET-ba^, 
b„ (see WO »7/2826,: W09837226, in tlu, i. signal measur^en. is based on 

spana. posita raUter d^ intensity. ,*e products of d.e .eacdon are se^gated torn 
the reactants. It is this change in spatial position that is measured. He FRET-based 
btosensor is based on the sepamtion. but not to another comparbnen, of a donor and 
acceptor pair. He intensity change is due to the physical sepatation of the donor and 
acceptor upon pn.tcoly«c cleavage. disadvantages of FRET-bascd biosensors are 
(1) .he SNR » rather low and difflcul, to measure. (2) a« signal is not acable. It must 
be recorded using living cells. Chemical fixation, for example with fcnnaldehyde 
camtot preserve both the parent and resultant signal; (3) the range of „avelengd« are' 
hnutmg and cover a larger range of the spectrum due to the pr«e.« of two 
fluon^bores or a fiuotophore and chromoph^; (4) the construction has gn^ter 
mntanons in dta. the donor and ^tor ^ be precisely arranged .0 «,sure that the 
distance falls within 1-10 nm. 

Benefits of the positional biosensor inclute: (1) ability to concentrate the 
signal m otder to achieve a higher SNK. (2) abUity to be used with either Uving or fixed 
cdls; (3) only a single fluoresce,, signal is ne«,.d; (4) the atrangement of the domains 
of tte btosensor is mo,, flexible, only limiting 6ctor h, flte appUcaion of th. 
postbonal biosensor is the need to define the sp^al position of the signal which 
«,unes an imaging method with sufficient spafial resototion to resolve th. difeenc 
between the reactant compartment and the ptoduc companment 

O-of*" in an will nxogmzeU^t this approach can be adapt«i to n=po,. 
any desued combination of activides by simply making the appropriate substitutions 
for ft. protease recognition s«,„ence a«l the reaotan, ,a,g« sequntce, including but 
not tomted to those siiquences shown in Hgnre 29A-C 
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'^^^"•^ « "o«.«.r ^ 3 ..cM.r locB^.. (CP8GFPNUO 

" I'CR product were ligated as described above. 

Primers for Caspase 8. Nucleolar localization signal (CP8GFPNUC-CYTO): 

1) TCA TCA TCC GGA AGA AAA CGT ATA CGT ACT TAG CTC AAG 
TCC TGC AGG CGG ATG AAA AGA (SEQIDNO I63) 

C^cT AGA 
CAT CTC AAA ACC ACT TCT TIT CAT (SEQIDNO:164) 

3) TCA TCA TCC GGA AGA AAA (SEQ ID NO: 165) 

4) GAAGAA CGA TCG AGT AAG (SEQ ID NO: 166) 

sequence of the resulting biosensor is shown in SEQ ID NO: 23-24 TTais 
b.osen«,r includes the protease recognition site for caspase-8 (SEQ ID N0.74) 

ZTnT ^""^^"^^'"^^^P-'^-erecognitionsitefor caspas J 
(SEQIDNO:25-2(S) p-ae 

HK»e biases could be used ™u, other bios««o,s ta. p«».ss o„ same 

^ .gna. 00,0, «^ a. .a,,e.ed .0 sepa^e co^a^n,,^ ^ ^ „3,3^ 

CYTO. Tbo pnxto, Of each biosensor re«:«o„ ^ be ,^^y „eaa»ed due u, 

2^ Of .he pro^ b,«d 0. p,od«a .a^ng ,e,„e««. Boft p„rf„c.s 

from CPSOFPNUC-CYTO and CP3GFPNLSCvm«. i., . "I™*"*' 
,, '^■'^'™'^CYTO are separable due to the different 

^ poatnons. nucleus ™. nudeota^ .,e„ u-ough fte color, of the pn»,uc.s an, 

.he ^. Assessing the non™e.„,ar, nuoiear region in orde, to avoid the 
spahal overlap of *e t«. signals would perfom, ^ measurement of CP3GFPN1S in 
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the presence of CP8GFPNUC. The loss of the nucleolar region from the nuclear signal 
IS insignificant and does not significantly affect the SNR. The principle of assessing 
multiple parameters using the same product color significantly expands the number of 
parameters that can be assessed simultaneously m living cells. This concept can be 
extended to other non-overlapping product target compartments. 

Measur«nent of translocation to the nucleolar compartment is performed by (I) 
defining a mask corresponding to the nucleolus based on a nucleolus-specific marker 
including but not limited to an antibody to nucleolin (Lischwe et al.. 1981. £^ Cell 
Res. 136:101-109); (2) defining a mask for the reactant target compartment, and (3) 
detemuning the relative distribution of the signal between these two compartments 
This relative distribution could be represented by the difference in the two intensities 
or, preferably, the ratio of the intensities between compartments. 

The combination of multiple positional biosensors can be complicated if the 
reactant compartments are overiapping. Although each signal could be measured by 
simply detemuning the amount of signal in each product target compartment, higher 
SNR will be possible if each reactant is uniquely identified and quantitated. This higher 
SNR can be maximized by adding a second signal domain of contrasting fluorescent 
property. This second signal may be produced by a signal domain operatively linked to 
the product targeting sequence, or by FRET (see above), or by a reactant targeting 
sequence uniquely identifying it within the reactant compartment based on color, 
spatial position, or fluorescent property including but not limited to polarization or 
lifetime. Alternatively, for large compartments, such as the cytoplasm, it is possible to 
place different, same colored bios«,sors in different parts of the same compartment 

4. Protease biosensors with multiple copies of a second signal domain serving 
as a reactant target domain 

In another example. (CPSYFPNLS-SIZECFPn) increasing the size of the 
reactant is accomplished by using multiple inserts of a second signal sequence for 
example. ECFP (SEQ ID NO:50) (Figure 29A) (Tsien. R.Y. 1998. Amiu Rev 
B.ochem. 67:509-44). Thus, the multiple copies of the second signal sequence serve as 
the reactant target domain by excluding the ability of the biosensor to difiuse into the 
nucleus. This type of biosensor provides the added benefit of additional signal being 

no 
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avaiUbfe per >,io^ ^^^^^^ ^ 

„ „„i,„e signal, being .«ife„d, ^ ^ ^ 
formation, elc TWs could provide a m,„e dgral to fte .eactate. 

' domJ" ''«"°^'""'"'"» >>'»»«««r wM, W.».„embmne tenjeUng 

In an alten,aflve embodiment a .rans-membrane argeUng se,^enee ia naed «, 
U,e reaaa.^ to cytopiasmie .eaieiea. ^ an altaudve pn,toaae recogmta aito 
.s used. Tie toWbo<nlinnm biosensor (SEQ n> NOS=27.2. (eellubrevin); 29-30 
(aynaptobre™,) eonaisB of an NLS (SEQ ID NO:m) (Fig.,. 29Q. Fr«25 sign., 
domain (SEQ m NO=52, (Ffenre 2M), a toUnua or bonriinnm meMopro,^ 
-ogmuon sito a™, cellubre™ (SEQ ID NO:106) (Fig.„ 2,B, (MoMalron e, al 
Namre 364:346-349. 1993; Mar^n e. al.. CeU BioL. in press, or synaptobrevin (SEQ 
ID NO=I0«) 0^ 2,B, (GenBank Accession W64520,. and a nans-membrje 
s^uence 6om ecllubrevin (SEQ ID NO:,46) (Fig„. 29C) or synaptobrevin (SEQ ID 
NO=,44, (Rg.„ 29C, a. 3-^ „Mcb ^ ^ biosensor to eellnlar vesicles 
Tire N-ttnninns of eacb pn,.ein is crfemed towards .he cytoplasm. In to intei 
brosensor. GFP is todrered to to vesicles, ^.cn cleavage by to totanus or bomlinnm 
^nc meulloprotosse. GFP „i„ „„ tonger be assodated wid, to vesicle and is See to 
difltae Ihronghoul to cytoplasm and to nucieus. 

b. 5-domain biosensors 

1. Caspasc 3 biosensor with a nuclear localization domain and a 
second signal domain operatively linked to an annexin U domain 

biosensor is outlined in FIgnre 35, and the sequence 
shown in SEQ ID NO:33.34. This biosensor differ, fiom SEQ ID NO 11-12 by 
mcluding a second detectable signal. ECFP (SEQ n> NO:50) (Figure 29A) (signal 2) 
operatively linked to the reactant target sequence. 

2. Caspase 3 biosensor with a nuclear localization sequence and a 
(CPSYFPNLS-CFPCYTO) 
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to bios^sor (SEQ n. NO:31-32,. a. Nts pnxiuc. .^etag donuin (SEQ 

NO:44) (FIg.„ 29A). A DBVD protease ,«»g,,u„„ (gEQ m NO-«» 

(Hg.re 29B) « b«wecn ater to EYFP .goal domain and befcre to MAP4 
5 Piojaaon domain (SEQ ID NO:14J)(Figiii»2!>C). 

Eiamplell. HuorMcent Biosensor Toiin Charaolerization 

As used herein, W-„fte.o any „,ganism.ma<™noloc^o, or organic or 
«.<^c molecule or ion to. .l,«s nonnal physiological pmcesses f„™,d »*bin . 
cell, or any orgamsm, macromolecnle, or orgamc or inorganic molecnl. or ion 0„. 
a«e„ to physiological response to modulalon, of known physiological processes 
Tl^s, a „,in can mintfc a normal cell sdmUus, or can alttr . response to a n^mal cell 
Stimulus. 

Living cells are to targeb of taic ag«„s to. can compd^ orgamsms 
macomolecules. or organic or inorganic molecules. A cell-based approach » toxin' 
dcecnon. classiflcatton, and idendficaflon «uld exploi. to sensiSve and specific 
molecular dCecSon and ampUacadon systtm d^eloped by cells to sense minute 
changes in toir e«e™al milieu. By combining to evolved sensing capabiU.y of cells 
wth to lumiuescen. reporter molecules and assays descibed herein, indaceUular 
molecUar and chemical eveMs caused by ,oxic agenB can be converted into detecteble 
spatial and temporal luminescent signals. 

When a toxin interacts with a cell, whether it is at the cell surface or within a 
^ecific intracellular compartment, the toxin invariably undermines one or more 
components of the molecular pathways active within the cell. Because the ceU is 
compnsed of complex networks of intercomrected molecular pathways, the effects of a 
toxm will Ukely b. transmitted throughout many cellular pathways. Herefore. our 
strategy ,s to use molecular markers within key pathways likely to be affected by 
toons, including but not limited to ceU stress pathways, metabolic pathways, signaling 
pathways, and growth and division pathways. 

We have developed and characterized three classes of cell based luminescent 
y er molecules to serve as reporters of toxic threat agents. n,ese 3 classes ar. as 
follows: 
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(1) Detectors: general cell stress detection of a toxin; 

(2) Classifiers: perturbation of key molecular pa&way(s) for detection and 

Classification of a toxin; and 

(3) Ideroifiers: activity mediated detection and identification of a toxin or a 

5 group of toxins. 

Thus, in another aspect of the present invention, living cells are used as 
biosensors to interrogate the enviromnent for the presence of toxic agents, m one 
embodmient of this aspect, an automated method for cell based toxin characterization is 
disclosed that comprises providing an array of locations containing cells to be treated 
w.th a test substance, wherein the cells possess at least a first luminescent r^orter 
molecule comprising a detector and a second luminescent reporter molecule selected 
fiom the greup consisting of a classifier or an identifier, contactmg the cells with the 
test substance either before or after po^ession of the first and second luminescent 
reporter molecules by the cells; imaging or scanning multiple cells in each of the 
locations containing multiple cells to obtain luminescent sign^ fi,„ the detector 
convertmg the luminescent signals fiom the detector into digital data to automatically 
measure changes in the localization, distribution, or activity of the detector on or in the 
cell, which indicates the presence of a toxin in the test substance; selectively imaging or 
scanmng the locations contaimng cells that were contacted with test sample indicated to 
have a toxin in it to obtain luminescent signals fix>m the second reporter molecule- 
converting the luminescent signals fiom the second luminescent reporter molecule intci 
digital data to automatically measure changes in the localization. distribuUon. or 
activity of the classifier or identifier on or in the cell, wherein a change in the 
localization, distribution, structure or activity of the classifier identifies a cell pathway 
that IS perturt,ed by the toxin present in the test substance, or wherein a change in the 
localization, distribution, structure or activity of the identifier identifies the specific 
toxm that is present in the test substance. In a preferred embodiment, the cells possess 
at least a detector, a classifier, and an identifier, m a fiuther preferred embodiment, the 
digital data derived fiom the classifier is used to det^mine which identifier(s) to 
employ for identifying the specific toxin or group of toxins. 

As used herein, the phrase "the cells possess one or more luminescent reporter 
molecules" means that the luminescent reporter molecule may be expressed as a 
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celleilherbefbreorafterlreamentmlhthetKfsubstace. 

also b TlTT^ """^ ""^""^ -nay 

n«« ^^^^ ^ 

user 0^ ^ ^ ^ ^ .l^^s^ 

or .den^aer ,ype to y« a,„d,er population of celb id^Scal to or difl™. ft, 

cells containing the toxin detector. 

11.= de.e«or. classifier, and id«>«fier can con,pHse s»„. reporter motenle 
or they can comprise different reporters. 

^'=^ -stages in U.. localizaSon. dis«bn«on. slmcnne or aedvity of 
<^ d^ectors, eUssifier. and/or idenMe. can *e carried o„, in eim„ a Ugh 
^roughpn. or a high conren. „od. 1. gene.^ , ^ ^ ^ 

«. a hgh-tongbpn, assay if d,e spatial tafonnation .e„de,ed by fte Mgh^n«a„ assay 
can be re^Hled in such a way as ti, no longer recpirc optical spatial r«»lution on 
cellular or ^bcellnla, levels. For ^.ample, a H^^,en. assay for ™ic„n*ule 

morganr^uo. can be canicd ou, by optically resolvtog tai^^centiy labeled ceM^ 
uncronrbulcs ^ measuring u«r nrorphology (..g., bu«lled vs. non-bundled or 
normal). A high-tinoughp« version of a nncr„„*„,e re„,gani«ion assay would 

m^lveonlyanreasurem^^oftoulnncrombule polymer n«ssa«ercelh.lare.»a=ti^ 
Mti. a dc^en. -Hia. is. desUbUized mic^bules. hetog more easily extiaCed, would 

, -"' -»'»™'0Wmicn>.ub„le mass imninescence signal taunpertirted or drng^ 
«ab,l,zed luminescendy labeled micmtirbnlcs in anoto tieated cell population Tie 
lummesc^,, signal emanating ftom a domain co„«ng one or more celu will 
be pn^portional u, tire ti,nd mic^Urbule mass remaimng in U,e cells a«er foxin 
Ireatinent and detergent extractioa 

Tl» metitods for detecting, classi^ng. and identifying toxins can utiUze tite 
«me ^^reening methods described throughout the instant application, facluding but no. 
tatted to detecting change, in cytoplasm to nucl«,s translocation, nucleus or nucleolus 
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m^abo c processes „1> n.,my, «u substrate a«achn,c.t ceU c^k „e«s. and 

organellar structure and function. 

m all of these embodiments, the methods can be operated in both toxin-mimetic 
and toxm-inhibitory modes. 

Such ttctai^^ u. assKs ^ ^ 

.nchKbug. bu. ^ ^ ^vtauMena to ,« 

«mpl« .„d for texto uaii»d m ch«,ucal and biologica, weapons; ^ for d«e«n« 
ft. pr^enoe and characchsdca of toxins during e„vi,onn,«.ul ren,ediaUou. drug 
cliMca, applicaaous. and duHng nom«, dcveiopmen, ^ „snu6cWng 
process by vi,«.y any .ype of i„dus«y, including b« no, ItoUud to .griculuuo ,J 
PHKessing, sutoMobile. electe,„ic, ,ex«e. Medical device, and peta>leun, industtes 

W. have developed and characterized exanrples of lunjnescen. ceU-based 
reporters, disttbuted across d,e 3 sensor classes. Tie m«hods disclosed be™, can be 
"bhzed in conjuncSon wiu, con,puter datebases. and date management nuning 
remeval, and display n,«hods to e«ra« meaning^ pa,tems fion, ftc enormous date 
se, generated by each individual reporter or a combtoatorial of reporter in r«,ponse to 
.OX.C agenis. Such datebases a«l biofafonnaacs n,ed,ods include, bu. are uo, Umited 
o^tose disd^ed in U.S. Paten, Applicadon Nos. 09/437.976. med November .0 
.999; «V,45.770 fed ,uly „. ,99, and US. i-aten, AppUcadon Serial No. to be' 
assigned, filed February 19, 2000. (98,068-C) 

Any cell type can be used to cany out this aspect of the invention, including 
prokaryotes such as bacteria and archaebacteria. and eukaryotes. such as single celled 
fi.ng. for example, yeast), molds (for example. DiclyosteHum). and protozoa (for 
example, Euglena). Higher eulcaryotes, including, but not limited to. avian, amphibian, 
uisect, and mammalian cells can also be used. 

Examples n f Bin..i^ gnrj,f 
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Mitochondria) 
I Potential 
[Donnan Equiltbnum 

yeT 

I Heat Shock Protein 
(Hsp27.Hsp70) 
I Gyp-chimera 



Tubulin- 
cytoskeleton 
[P-tubulin-GFP 
chimera] 



I pp38 MAPK. stress 
I signaling 

[antibody and GFP- 

chimera] 

NF-kB- stress 
signaling 

[antibody and GFP- 
chimera] 



1 Positive 

LLCPK (pigcpithelia) Valinomycin 
Rat prtmafy hqjalocytes OOhM-ioohM) 
' FCCP 



HeLa 
3T3 



BHK 
HeLa 
LLCPK 



IkB 

[con^lement to NF- 

kB | 

Tetanus Toxin 
[Protease activity- 
based sensor] 



Anthrax LP 
[Protease activity- 
[based sensor] 



3T3 
LLCPK 



HeLa 

3T3 

BHK 

SNB19 

HepG2 

LLCPK 



In many cell types 



In many cell types 



In many cell types 



I (lOoM-tOOMM) 

Cadmium 
OOriM) 



TOF-a 
(lOQng^) 



PacUtaxel Staurosporine 

(10iiM-20mM) (InM-ljiM) 

Cuiacin-A 

(5 nM-lO^M) 

Nocadazole 

(7nM-I2(iM) 
' Colchicine 
I (5 nM-10HM) 

Vinblastine 

(SnM-lOp 

Anisomycin TNF-a 
Cadmium 

I (IOmM) 

I TNF-ct Anisomycin 

(I0Ong/jnl-038pgAnl) (lOnM-lOpM) 

^1 Cadmium 

(4ng^-.0ft5pg/ml) (MO pM) 



I Nisin 

(2-1000|i«Anl) 

Streptolysin 

(10 

Anisomycin 

' 000 - 



PenitremA 

00 MM) 

Valinomycin 
OmM) 



Examples of Detectors: This class of sensors pn,vides a first line signal that 
indicates the presence of a toxic agent This class of sensors provides detection of 
general cellular stress that requires resolution limited only to the domain over which the 
measurement is being made, and they are amenable to high cont«it sci«ms as weU 
Thus, either high throughput or high content screening modes may be used, including 
but not limited to translocation of heat shock factors from the cytoplasm to the nucleus 
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and change in ^H.^ „emh,ane potenW, tenceltato fr« ion 

d«e«io. (f„ example, Ca"; If), genend aeuboUc *a„s. ceU cycle tin^g evcnte 

and organellar structure and fimction. 



^- Mitochotutri nI Pntt^ tinf 

A key to maintenance of cellular homeostasis is a constant ATP energy chaiBe 
The cycling of ATP and its metabolites ADP. AMP. inorgamc phosphate, and solution- 
phase protons is continuously adjusted to meet the catabolic and anabolic needs of the 
cell. Mitochondria are primarily responsible for maintaining a constant energy charge 
throughout the entire cell. To produce ATP fiom its constituents, mitochondria must 
mamtam a constant membrane potential within the organelle itself TT^erefore 
measurement of this electrical potential with specific luminescent prebes prevides J 
sensitive and rapid readout of cellular stress. 

We have utilized a positively charged cyanine dye, JC-1 (Molecular Prebes 
Eugene. OR), which difiUses into the ceU and readily partitions into the mitochondrial' 
membrane, for measurement of mitochondrial potential. The photophysics of JC-1 are 
such that when the probe partitions into the mitochondrial membnne and it experiences 
an electrical potential >,40 mV. the probe aggregates and its spectral response is 
shifted to the red. At membrane potential values <140 mV. JC-1 is primarily 
monomeric and its spectral response is shifted toward the blue. Therefore, the ratio of 
two emission wavelengths (645 nm and 530 nm) of JC-1 partitioned into mitochondria 
provides a sensitive and continuous measure of mitochondrial membrane potential 

We have been making live cell measurements in a high throughput mode as the 
basis of a generalized indicator of toxic stress. TTie goal of our initial experiments was 
to determine the r^tio of J-aggregates of JC-1 dye to its monomeric form both before 
and after toxic stress. 
Procedore 

1. Cells were plated and cultured up to overnight 

The JC-1 monomer was measured optimaUv with a 48^ nm ^.-T I ^ «t • 
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Results 



3 



The mitochondrial potential within several types of Uving cells, and the effects 
of toxm. on the potential were measured using the fluorescence ratio Em 645 (590y 
Em 530 (485) (excitation wavelengths in parentheses). For example, we measured the 
effect of 10 m valinomycin on the mitochondrial potential within LLCPK cells (pig 
q)rthelia). Within seconds of treatment, the toxin induced a mor. rapid and higher 
magnttude decrease (an approximately 50% reduction) in mitochondrial potential than 
that found in untreated cells. Hcpatocytes were also determined to be sensitive to 
vahnomycin. and the changes in mitochondrial potential were nearly complete within 
seconds to minutes after addition of various concentrations of the toxin. 

These results are consistent with mitochondrial potential bemg a model 
mtracellular detector of cell stress. Because these measurements require no spatial 
resolution within individual cells, mitochondrial potential measurements can be made 
rapidly on an entire cell array (e.g. high throughput). TTris means, for example, that 
complex arrays of many cell types can be probed simultaneously and continuously as a 
generahzed toxic response. Such an indicator can provide a first line signal to indicate 
that a general toxic stress is presem in a sample. Further assays can then be conducted 
to more specifically identify the toxin using cells classifier or identifier type reporter 
molecules. 



^' ^gg/ Shark Prnt^i^v 

Most mammalian cells will respond to a variety of enviromnental stimuli with 
the mduction of a family of proteins called stress proteins. Anoxia, amino acid 
analogues, sulflrydryl-reacting reagents, transition metal ions, decouplers of oxidative 
Phosphorylation, viral infections, ethanol. antibiotics, ionophor^. non-steroidal 
antimflammatory drugs, thermal stress and metal chelators are all inducer, of cell stx««. 
protem synthesis. Amotion, or both. Upon induction, cell stress proteins play a role in 
foldmg and unfolding proteins, stabilizing proteins in abnomral configurations, and 
repairing DNA damage. 

There is evidence that at least four heat shock proteins translocate fiom the 
cytoplasm to the nucleus upon stress activation of the ceU. These proteins include the 
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heat shock piotems HSP27 and HSP70, the heat shock cognate HSC70, and the heat 
shock transcription factor HSFl. Therefore, measurement of cytoplasm to nuclear 
translocation of these proteins (and other stress proteins that translocate from the 
cytoplasm to the nucleus upon a cell stress) wiU provide a i^id readout of cellular 



stress. 



We have tested the response of an HSP27-GFP biosensor (SEQ ID 169-170) in 
two cell lines (BHK and HeLa) using a libra^r of heavy metal chemical compounds as 
biological toxin stimulants to stress the cells. Briefly, cells expressing the HSP27-GFP 
biosensor are plated into 96-well microplates, and aflowed to attach. The cells are then 
treated with a panel of cell stress-inducing compounds. Exclusively cytoplasmic 
localization of the fusion protein was found in unstimulated cells. 

Other similar heat shock protein biosensors (HSP-70, HSC70, and HSFl fused 
to GFP) can be used as detectors, and are shown in SEQ ID NO: 171-176. 



Examples of Classifiers: 

This class of sensors detects the presence of. and further classifies toxins by 
identifying the cellular pathways) perturbed by the toxin. As such, this suite of sensors 
can detect and/or classify toxins into broad categories, including but not limited to 
"toxins affecting signal transduction," "toxins affecting the cytoskeleton." and "toxins 
affecting protein syndesis". Either high throughput or high content screening modes 
may be used. Classifiers can comprise compounds including but not limited to tubuUn. 
microtubule-associated proteins, actin. actin-binding proteins including but not limited 
to vinculin. a-actinin, actin depolymerizing factor/cofilin. profilin. and myosin; NF-kB, 
IkB, GTP-binding protems including but not limited to rac, riio, and cdc42. and stress- 
activated protein kinases including but not limited to p38 mitogen-activated protein 



kinase 



^- Tubulin-rvtns^^tftn^ 

The cell cytoskeleton plays a major role in cellular fimctions and processes, 
such as endo- and exocytosis. vesicle transport, and mitosis. Cytoskeleton-affecting 
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chs^ Of ^ ^ ^ ^ ^ 

regulaw-y co™pon«= ^ ^ 

^easw^en, of s.™c«n, changes in tte cyu^^ ^ " 
M» . .hss Of cy^elc^n-fiecdng .oxta. Ms asaa, can b. conduct in a 
h.8h con-en. nK«K as dc^bcd preWously. or in a bi^ tooughpu, n,od. For high 
throughput as discussed previously. 

Such measurement „il, be vahable f„r idenUScata of Krins including, bu, 
no. lumted u, and-nucn,b*«le agent, agents >ba. gencaUy affic, celi cycle 
P^g-ession and ceU p^liferaho^ h^cellular signal hansducdon. and me«»,ic 
processes. 

For microtubule disruption assays, LLCPK cells stably transfected with a 
tubuhn-GFP biosensor plasmid were plated on 96 well cell culture dishes at SO^O"/. 
confluence and cultured overnight at 37 -Q 5% CO. A series of concentrations (10- 
500 nM) of 5 con^pounds (paclitaxel, curacin A. nocodazole. vinblastbe. and 
colchicne) in nonnal culture n,edia were freshly prepared fren. stock, and were added 
to ce^^ culnue dishes to replace the old culture ™edi. TT.e cells were then observed 
with the cell screening system described above, at a 12 hour time point 

Our data indicate that the tubuUn chimera localizes to and assembles into 
microtubules throughout the cell. THe microtubule arrays in cells expressing the 
chmiera respond as follows to a variety of anti-microtubule compounds: 

RraEonse 
Vmblastine Destabilization 
Nocodazole Destabilization 
Pacbtaxel Stabilization 
Colchicine Destabilization 
CuracmA Destabilization 

Similar data were obtained using cells expressing the tubulin biosensor that 
were pattem«l onto ceU arrays (such as those described in U.S. Patent Application 
S^. No 08/865,341 filed May 2, 199, incorporated by reference hel ir, Z 
entu^ty) and dosed as above. 
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NF-kB 



NF-kB is cytoplasmic at basal levels of stimulation, but upon insult translocates 
to the nucleus where it binds specific DNA response elements and activates 
transcription of a number of genes. Translocation occurs when IkB is degraded by the 
proteosome in response to specific phosphorylation and ubiquitination events HcB 
normally retains NF-kB in the cytoplasm via direct interaction with the protein, and 
masking of the NLS sequence of NF-kB. Therefore, although not the initial or defining 
event of the whole signal cascade. NF-kB translocation to die nucleus can serve as an 
indicator of cell stress. 

We have generated an NF-kB-GFP chimera for analysis in live ceUs. TTus was 
accomplished using standard polymerase chain reaction techniques using a 
characterized NF-kB p65 cDNA purehased from Invitrogen (Carlsbad, CA) fused to an 
EYFP PGR amplimer that was obtained fix,m Clontech Laboratories (Palo Alto, CA) 
The rewriting chimera is shown in SEQ ID NO:177-178. The two PCR product^ were 
Iigated into an eukaryotic expression vector designed to preduce the chimeric protein at 
high levels using the ubiquitous CMV promoter. 



NP-kB immunolocalization 

For further studies, we characterized endogenous NF-kB activation by 

immunolocalization in toxin treated cells. The NF-kB antibodies used in this study 

were purehased fiom Santa Cruz Biotechnology. Inc. (Santa Cruz, CA). and secondary 

antibodies are from Molecular Probes (Eugene. OR). 

For the 3T3 and SNB19 ceU types, we determined the effective concentrations 

that yield response levels of 50% of the maximum (EC50). expressed in units of mass 
per volume (ng/ml) and units of molarity. Based on molecular weights of 17 kD for 
both TNFa and IL-lo, the EC50 levels for these two compounds with 3T3 and SNB19 
cell types are given in units of molarity in Table 1. Our results demonstrated 
repreducibility of the relative responses fiom zero to maximum dose, but fiom sample 
to sample th^ have been occasional shifts in the baseline intensities of the response at 
zero concentration. 
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For these experiments, either 10 or 100 TOFa-treated 3T3 or SNB19 cellsAveU 
we,^ tested. On the basis of the standard deviations measured for these samples and 
by taking t-values for the student's t-test. we ^ve estimated the minimum dete^able 
doses for each case of cell type, compound, number of cells per weU. and for different 
choices of how many wells are sampled per condition. TT^e latter fector determines the 
number of degrees of freedom that are provided in the sample of data. Increasing the 
number of wells fix,m 4 to 16. and increasing the number of cells per well from 10 to 
100. miproves the minimum detectable doses considerably. For 3T3 cells, which show 
lower minimmn detectable doses than the SNB19 cells, and for the case of 1% false 
negative and 1% false positive rates, we estimate that 100 cells per well and a sarhpling 
of 12 or 16 wells are sufiBcient to detect a dose apprt>ximately equal to the EC50 value 
of 0.15 ng/ml. If the false positive rate is relaxed to 20%, a concentration of 
approximately half that value can be detected (0.83 ng/ml). One hmidred cells can 
conveniently be sampled overacell culture surface areaofless than 1 mm\ 



TMe 1. EC50 levels for TNFa and IL-la (based on molecular weights of 17 kD 



for 



Compound 


CellTvDc 


EC50 (10 -'^moles/liter) 








TNTFa 


3T3 


8.8 




SNB19 


5.9 








IL-la 


3T3 


0.24 




SNB19 


59 



3. Phospho-p38 Mitogen Activated Protein Kinase (Dv38hfAPIC\ 

MAPKs play a role in not only cell growth and division, but as mediators of 
cellular stress responses. One MAPK. p38. is activated by chemical stress inducer, 
such as hyper-osmolar sorbitol, hydrogen peroxide, arsenite. cadmium ions 
amsomycin. sodium salicylate, and LPS. Activation of p38 is also accompanied by its' 
translocation into the nucleus fiom the cytoplasm. 
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MAPK p38 lies in a pathway that is a cascade of kinases. Thus, p38 is a 
substrate of one or more kinases, and it acts to phosphorylate one or mot. substrates in 
time and space within the living cell 

The assay we present here measures, as one of its paramet^.. p38 activation 
5 usmg mmiunolocalization of the phosphorylated form of p38 in toxin-treated cells The 
assay was developed to be flexible enough to include the simultaneous measurement of 
other parameter, within the same individual cells. Because the signal transduction 
pathway mediated by the transcription factor NF-kB is also known to be involved in the 
cell stress response, we included the activation of NF-kB as a second parameter in the 
10 same assay. 

Our experiments demonstrate an immunofluorescence ^roach can be used to 
measure p38 MAPK activation either alone or in combination with NF-kB activation in 
the same cells. Multiple cell types, model toxins, and antibodies were tested, and 
significant stimulation of both pathways was measured in a high-content mode The 
15 phospho-p38 antibodies used in this study were purchased from Sigma Chemical 
Company (St. Louis. MO). We report that at least two cell stress signaling pathways 
can not only be measured simultaneously, but are differentially responsive to classes of 
model toxins. Figure 36 shows the differential response of the p38 MAPK and NF-kB 
pathways across three model toxins and two different cell tj^es. Note that when added 
20 alone, three of the model toxins (ILla. TNFa and Anisomycin) can be differentiated 
by the two assays as activators of specific pathways. 

IkB chimftra 

ncB degradation is the key event leading to nuclear translocation of NF-kB and 
25 activation of the NFkB-mediated stress response. We have chosen this sensor to 
complement the NF-kB sensor as a classijier in a high-throughput mode: the 
measurement of loss of signal due to degradation of the IkB-GFP fusion protein 
squires no spatial resolution within individual cells, and as such we envision IkB 
degradation measurements being made rapidly on an entire ceU substrate. 
30 TTus biosensor is based on fusion of the first 60 amino acids of DcB to the 

Fred25 variant of GFP. SEQ m 179-180 This region of DcB contains all the regulatory 
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sc,uenc«. tocluitag pbo»phao-Wo. site, and ^^^^ ^ „^ „ ^„ 
^som. degmbaon upon U,e bk>«„s„r. Kn^g m., «i^udo„ of aa, pahway 
to. wo^ s^cally lead B NFkB «a„„„cado„ «„te ta d.^„n „f ads biosenj 
M«uto™g d„, fluo^ee i*„si,y of cells e^g ^ 
degradation process. 



Examples of Identifiers: 

In our toxin identification strategy, the first two levels of characterization ensure 

a rapid readout of t.xi„ class without sacrificing the abifity to detect many new mut^^ 
toxms or dissect several complex mixtures of known toxins. The third level of 
baosensors are identifier, which can identify a specific toxin or group of toxins. In one 
embodiment, an identifier comprises a protease biosensor that r^onds to the activity 
of a specific toxin. Other identifier are produced with reporters/biosensors specific to 
the. activities. n.ese include, but are not limited to post-translational modifications 
such as phosphorylation or ADP-ribo^lation, translocation between cellular organelles 
or compartments, effects on specific oi^anelles or ceUular components (for example 
membrane permeabilization. cytoskeleton rearrangement, etc.) 

APP-ribo.sylatinptorin. -These toxins include Pseudomonas toxin A. diptheria 
toxm. botulinum toxin, pertussis toxin, and cholera toxin. For example. C. botulinum 
C2 toxin induces the ADP-ribosylation of Argl 77 in the cytoskeletal protein actin. thus 
altenng :ts assembly properties. Besides the construction of a classifier assay to 
measure actin-cytoskeleton regulation, an identifier assay can be constructed to detect 
the specific actin ADP-ribosylation. Because the ADP-ribosylation induces a 
confonnational change that no longer permits the modified actin to polymerize this 
conformational change can be detected intracellularly in several possible ways 'usin^ 
lummescent reagents. For example, actin can be luminescentiy labeled using a 
fluorescent reagent with an appropriate excited state lifetime that allows for the 
measurement of the rotational difiusion of the intracellular actin using steady state 
fluorescence anisotropy. toxin-modified actin wiU no longer be able to 

assemble into rigid filaments and wiU therefore produce only luminescent signals with 
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relatively low anisotropy. which can be readUy measured with an imaging system In 
another embodiment, actin can be labeled with a polarity-sensitive fluorescent reagent 
that reports changes in actin-conformation through specti^ shifts of the attached 
reagent. ITiat is. toxm-treatment wiU induce a conformational change in intracellular 
actin such that a ratio of two fluorescence wavelengths wiU provide a measure of actin 
ADP-ribosylation. 

Cytotoxic phospholip; ,^^. - Several gram-positive bacterial species pn,duce 
cytotoxic phospholipases. For example. Clostridium perfiingens produces a 
phosphoUpase C specific for the cleavage of phosphoinositides. Th^ 
Phosphoinositides (e.g.. inositol 1.4.5-trisphosphate) induce the release of calcium ions 
from mtraceUuIar organelles. An assay that can be conducted as either high-content or 
high-throughput can be constructed to measure the release of calcium ions using 
fluorescent reagents that have altered spectral prx,perties when complexed with the 
metal .on. Therefore, a direct consequence of the action of a phospholipase C based 
toxm can be measured as a change in cellular calcium ion concentration. 

Exfoliative toxins - TTiese toxins are produced by several Staphylococcal 
species and can consist of several serotypes. A specific identifier for these toxins can be 
constructed by measuring the morphological changes in their target organelle the 
desmosome. which occur at the junctions between cells. The exfoliative toxins are 
known to change the morphology of the desmosomes into two smaller components 
called hemidesmosomes. hi the high-content assay for exfoliative toxins, epitheUal cells 
whose desmosomes are luminescently labeled are subjected to image analysis An 
method that detects the morphological change between desmosomes and 
hemidesmosomes is used to quantify the activity of the toxins on the cells. 

Most of these identifiers can be used in high throughput assays requiring no 
spatial resolution, as well as in high content assays. 

Several biological threat agents act as specific proteases, and thus we have 
focused on the development of fluorescent protein biosensors that report the proteolytic 
cleavage of specific amino acid sequences found within the target proteins. 

A number of such protease biosensors (including FRET biosensors) ai^ 
disclosed above, such as the caspase biosensors, anthrax, tetanus. Botulinum, and the 
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zinc metalloproteases. FRET is a powerful technique in that small changes in p„,tein 
conformation, many of which are associated with toxin activity, can not only be 
measured with high precision in time and space within living cells, but can be measured 
m a high-througlq>ut mode, as discussed above. 

As described above, one of skill in the art will recognize that the protease 
biosensors of this aspect of the invention can be adapted to report the activity of any 
protease, by a substitution of the appropriate protease r^gnition site in any of the 
constructs (see Figure 29B). As disclosed above, these biosensors can be used m high- 
content or high throughput screens to detect in vivo activation of enzymatic activity by 
toxms. and to identify specific activity based on cleavage of a known recognition motif 
These biosensors can be used in both live cell and fixed end-point assays, and can be 
combmed with additional measurements to provide a multi-parameter assay 



Anthrax LF 



Anthrax is a well-known agent of biological warfare and is an excellent target 
for development of a biosensor in the identifier class. Uthal factor (LF) is one of the 
protem components that confer toxicity to anthrax, and recently two of its targets within 
cells were identified. LF is a metaUoprotease that specifically cleaves Mekl and Mek2 
proteins, kinases that are part of the MAP-kmase signaling pathway. Construction of 
lethal factor protease biosensors are described above. (SEQ ID NO:7-8; 9-10) Green 
fluorescent protein (GFP) is firsed in-fiame at the amino terminus of either Mekl or 
Mek2 (or both), resulting in a chimeric protein that is retained in the cytoplasm due to 
the presence of a nuclear export sequence (NES) present in both of the target 
molecules. Upon cleavage by active lethal fector, GFP is released fiom the chimera and 
IS free to difiuse into the nucleus. Wore, measuring the accumulation of GFP in the 
nucleus provides a direct measure of LF activity on its natural target, the living ceU. 

While a prefermi form of the invention has been shown in the drawings and 
descnbed, since variations in the preferred form will be apparent to those skilled in the 
art. the invention should not be construed as limited to the specific fom, sho^ and 
described, but instead is as set forth in the claims. 
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CLAIMS 

We claim: 

1. An automated method for cell based toxin characterization comprising 

-providing an array of locations containing cells to be treated with a test 
substance, wherein the ceUs possess at least a fir^ luminescent reporter molecule 
compnsmg a detector and a second luminescent reporter molecule selected fiom the 
group consisting of a classifier or an identifien 

-contacting the cells with the test substance either before or after possession of 
the first and second luminescent reporter molecules by the ceUs; wherein the 
locahzation. distribution, structure, or activity of the first and second lumirrescent 
reporter molecule is modified when the cell is contacted with the toxin, 

-imaging or scamiing multiple cells in each of the locations containing multiple 
cells to obtain luminescent signals from the detector; 

-converting the luminescent signals from the detector into digital data; 
-utilizing the digital data from the detector to automatically measure the 
locahzation. distribution, or activity of the detector on or in the cell, wherein a change 
m the localization, distribution, structure or activity of the detector mdicates the 
presence of a toxin in the test substance; 

-selectively imaging or scamung the locations containing cells that were 
contacted with test sample indicated to have a toxin in it to obtain luminescent signals 
from the second reporter molecule; 

-converting the luminescent signals fiom the second luminescent reporter 
molecule into digital data; 

-utilizing the digital data fiom the second luminescent reporter molecule to 
automaticaUy measure the localization, distribution, or activity of the classifier or 
.dentxfier on or in the cell, wherein a change in the localization, distribution, structure 
or activity of the classifier identifies a cell pathway that is perturbed by the toxin 
present m the test substance, or wherein a change in the localization, distribution, 
structure or activity of the identifier identifies the specific toxin or group of toxins that 
are present in the test substance. 
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2. The method of claim 1 wherein the second luminescent r^orter molecule is a 

class.fier.and the digital dataderivedfiom the classifier is used to sel^^ 
Identifier for identification of the specific toxin or group of toxins. 

5 3. Anautomatedmethodforcenbasedtoxincharacterizationcomprising 

-providing an array of locations containing cells to be treated with a test 
substance, wherein the cells possess at least a first luminescent reporter molecule 
comprising a detector, a second luminescent reporter molecule comprising a classifier 
and a third luminescent reporter molecule comprising an identifier. 

' ^'^^g^-^l'^-ith the test substance either before or after possession of 

the fir^ second, and third luminescent reporter molecules by the cells; wherein the 

localization, distribution, structure, or activity ofthefii^ second, and 
reporter molecules is modified when the cell is contacted with the toxin. 

-imaging or scamiing multiple cells in each of the locations confining multiple 
cells to obtain luminescent signals fiom the detector. 

-converting die luminescent signals fix,m the detector into digital data; 
-utilizing the digital data fiom the detector to automatically measure the 
localization, distribution, or activity of the detector on or in the cell, wherein a change 
in the localization, distribution, stmctur. or activity of the detector indicates the 
presence of a toxin in the test substance; 

-sclectivdy imajing „, ^c^^ ^ ^^^^^ ^ ^ 

combed «s. .^pte tadieaud „ . .oxto i. i, ^ ^ „„^, 
trom the classifier; 

-converting the luminescent signals torn the classifier into digital data; 

-utilizing the digital data fiom the cla^er to automatically measure the 
locahzation.di^bution. or activity ofthe classifier on or in the cell, wh^^ 
m the localization,, distribution, structure or activity of the classifier identifies a ceU 
pathway that is perturbed by the toxin pr^ent m the test substance; 

-selectively imaging or scanning the locations contairiing cells that were 
contacted with test sample indicated to have a toxin in it to obtain luminescent signals 
from the identifier; 

-converting the luminescent signals fiom the identifier into digital data; and 

128 



wo 00/50872 

PCT/USOO/04794 

«P«<ic toxin »r g™,, of toxte thai is p«e„, in fte substmc 
4. ■^■«ft«'ofcWm3,rt,e,ei„a„jigia,4^j^,^g^^^l^^^ 
" """^"^ *" the specific U»n or gronp 



10 5. 



IS 



fto„ ac gro„ cona-aing of kea, *ock p,o.eins and con,„„n^ ^ ^ 
organellarstnicture and function. 

"Cfccted *o„ fto „^ „f ndc^lubnle-associaM p„«ins. actfn. 

acnn-hnding proteins, NF-,cB, WB. and stts^y^ucd kinases. 



20 7 



The meO«,d of any one of claim M wherein d,e cell pad,„ay is selected from 
^group eonsisdng of ee,, stress pathways. eeU ntetaholic padtways. cell signaling 
Pathways.cellg,„Mhpalbways.andcelldivisionpathways. 

8. jn«-»«k«'°f'=>atal,whe,eind,esecondl™nineseentrep„,,ern»le^^ 
- ■--»---'^'^««eriden.ifiesa.oxinorg,n.,of.„^sete«ed,K,n..h.gIZ 



9^ ^ ■°«'°<' "f-i' claim 3-7, Wherein U,e identifier ide«ifles,te,ti„ or 
'OMa.cytotox.cphospholipascs.andcxfcBative.oxins. 
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10. 



Ite method of any of clauns 1-9 wherein the change in the localization, 
chs^bufon. structure or activity of the first, second, or third luminescent reporter 
molecules is selected from the group consisting of cytoplasm to nucleus translocation, 
nucleus or nucleolus to cytoplasm translocation, receptor internalization, mitochondrial 
► membrane potential, loss of signal, the spectral response of the reporter molecule 
phosphorylation, intracellular free ion concentration, cell size. ceU ^, cytoskeleton 
orgamzation. metaboUc processes. ceU motility. ceU substrate attachment. ceU cycle 
events, and oi;ganellar structure and function. 

11. me method of any one of claims 1-10. wherein the imaging or scamung 
multiple cells in each of the locations containing multiple cells to obtain luminescent 
signals from the detector is carried out in a high throughput mode. 

12 Tbe method of any one of claims MO. wherein the imaging or scamung 
multiple cells in each of the locations containing multiple cells to obtain luminescent 
signals from the detector is carried out in a high content mode. 

13. n.e method of claim 1-10 wherein the selective imagmg or scamung of the 
locations containing cells that were contacted with test sample indicated to have a toxin 
m it to obtain luminescent signals from the second or third reporter molecule is carried 

out in a high throughput mode. 

14. The method of claim 1-10 wherein the selective imaging or scamiing of the 
locations containing cells that were contacted with test sample indicated to have a toxin 
m It to obtain luminescent signals from the second or third reporter molecule is carried 

out in a high content mode. 

15. THe method of any one of claims 1-14 forther comprising providing a digital 
storage media for data storage and archiving. 

16. The method of claim 15 further comprising a means for automated control, 
acquisition, processing and display of results. 
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and . con^ „^ ft, ^ 

oigital camera^ 

"> 18. ABtforceUbastdtoxiBdetaaoncomprisiiig; 

"« <»■' "Pon- "«"«=-l=. wherein U^c,^ disrtb«i„„, 

.5 »v *L '"'^^'''■^"««>'«="'«'° '"you. to „«aod of 



20 



25 



0 



20. An»..™na«d„ea,„dforcel.based»xmchanicWzationco»prtsmg 

•■»<^f^>fi««"yofk«aSonsco„tei™g«a.,„,e^^^,^ 
wh^ eeus posse, a . fi^ 
™g a repo^ „„,„^e se>e«e. fio. ^up co„sis«„g „f <^ ^ 

-«"-«"8''"=Il»™**c,es.s„toa^^^„„^ 

When the ceU is contacted with the toxin, 

cells toot^T""^ 

cells to obtain luminescent signals fiom the detector, 

-converting the luminescent signals fiom the detector into digital data; 
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». «.e locals™. dis«bu«o^ or a«ivi,y of fl. ^ ^ 

presence of a toxin in the test substance, 

amy of lc«*onscoMainin8 cells to be treated ™.h .be ^ 
substance wbetein the cells possess a leas, a second taU,^, ^„ .^^^ 

-"Pnsn^ a tepcrtern^lecnle selected ftom tbe g,o„p consisting of classifies and 

lt"!r ^"""^ " CCDs can cornprtse 

ettber the sameoradifreremcell type as the fin^atrayoflocations containing cells- 

■"°'»«"8*=«-nd«.yoflocations containing cells ™,h the t« 
«*er before or alter possessi™ of the sec«, l^inescent r^er molecule by the 
cells; Wherein the l„cali.ho„. distribntion. s^uctnre. or acd«ty of the slnd 
lummescent reporter molecnle is nK,dif ed when the cell is contacted mth the toxhv 

-umizh^ the digital data ftom to second l„n»n.se«« r^er molecale to 
^.tonatically mcasuto the localization, distdbution. or acdvity of the classifier or 
■denttfier „„ or in the cell whe^n . change in .be taxation, distHbrton, a,,,^ 
or achvio, of the classifier identifies a cell pathway that is pertnrbed by the toxm 
present m the test substance, or wherein a change in the localization, distribation. 

stature or acfivityoftheid^^ifieridentifies the specific toxin or gronpoftoxins that 

are present in the test substance. 
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SIGNAL SEQUENCES 




Myc epitope 



AA Beg; PPEPET 

5 ' GCAGAAGAACftAAAATTAATAAGCGAAGA 
AGACTTA 

AA Seq: AHEQKLISEBDIi 



40 
41 



42 



MacArthur and Waller 
1984. J Virol. 52:483-01, 



Gosney, etal.,1990. 
Anticancer Res. 10:623-8. 



EYFP: SEQ ro NO: 43 (Nndefc .dd); SEQ m NO:44 (Amino .dd) 
MVSK QBBL PTGV Vdtt . 

^=c^«cLL ^scLl^c 

«^0^ S^cL^C JL^ToJccL LLjcJcC 

JrcLL^Lc LSu=Sxirc SccSncLL: 
c«L^ ?rcL?c4c J,,,L:L4c 

™^»^ LL^LL, LSccSccL, LL4L 
^-i^Tc LJkLLc LLJ«L I^LoL 

'tLBY NYNS H NVY Tif»« 

A^K^T^ «CT»c«™c «c«c™:.„ ItcJ^^ccL LLLX^ 
cocTc.^ «c™«.^ «,co«:LJtc LLLLc 
^"c^AcL, U^JccL Itc^cSJoc c'cc^Jh, LL^L 
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YLSy QSAL SKDp WPitd « 
TACCTOAGCTAC CAGTCCGCCCTG AGCAAAflACCCC AACGAoLgSgC LTL^Tt^L 

c««n«sAG?rc LL4cScc L^xdcT^rc ggcJi4cJag ^acL 



EGPP: SEQIDNO:4S (Nucleic acid); SEQ ID NO:46 (Aminoadd) 

M V S K G E E L p t G V V P T r « » , 
A^XHAOou^ GGCOACCAGC™ TTCACCOOGGTG LL^TcL Li^C 

^TaL L?.cL4c LcLJccL LicLL ^4cIccJaC 

^cL^cc ^rGL^rcixc ^cLcLL LL^cL ccc^o^ccLc 
^cLlc dec LLcLL: Jac?cc "a^r^ 

^?^c?rc ?tcLg?ccScc JtgJccL^gc LcLLL S.cLcJtcJtc 
ttcLLSac L^^cLL L?GcJcoL LL^tcLg icLcL^xo 
^cLJtc LLLL LLircL LL^cL ATcUx^^ 

KLBYNyusHHVY'TMKn^ 

^TcLL LcJtcLItc SocLL^Jtc LLLL: 
»cn>cT«oo c«a«c»ccccc ««o«»coac ccc^,^ LLL:^ 
^«^«4ic L^ci^ LcLL^c LLL^c ^tL^toJtc 
^'.^r^ LLcSccSc^ LJtJctJtc 



EBPP 



: SEQIDNO:47 (Nucleic add); SEQ ID I*0:48 (Amino add) 



^nJ' ^ ^ O.EEL PTOV VPTt u», 
AT^GXtSAGCAAG GGCGAGGAGCTG rrCACCGGGGxO GTGCcJttL ^TtivG^ 



D 
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^^^^cLL Lc^^ 

^^«ic ^wLJtJtc ?«iJcc^ 

I-VTT LTHQ V QCP Rov« « 
C™™««CC ™«CCC»CX=0C CTC^^CXTC LcLJ^Lc LcLJ^L 

^"^SvcLJt: ?„JL?c4c ItoLLLc LLLcL L:Lc:iT4c 
r^<i>^ ^iLL LcSjcS,:^ LLJrcL. LS>cLL 
^S«:^TC LJroLic I™L„i,i>„ LLi.^ LircLL 
•^ocTo^ «.:r,c»c^ c«woTcT« JtcI:.„=^ 
^-LcLL LcJJ^oirc LLU=c 

OBVO OBTP lODO P»,, . 

=«»™ca« c»a«c«:c<:oc »™ccm««e ^ocLJra™ LS„^;vcL 

L^lcc^ LiJLc^ 
^-^«U,c LLc^cc LLL^ S^,^ 



BCW. SEQIDNO:* <N.deh.cid),SEOIDNoaO(A«dn.«*i) 

;«icLJ,« L4ccJt,^,c 

LcLcL?^ L^z^c^c LicLL L4cLJ« 
^-"^^ 

'^l^ 'JLLL: IJLLL: «cic?»cL LJccJ^ 
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P K D D CMYK TRAE VKPE QDTL 
TTCAAGGAOGAC GGCAACTACAAG ACCCGCGiCCGAG GTQAAtrrrCGAG GQCGACACCCTO 

VNRI ELK O IDPK BDQN I LOH 
GTGAACCGCATC GAGCTOAAGGGC ATCGACTTCAAG GAGGACGGCAAC ATCCTGGG^ 

KLBY NYIS HHVy ITAD K Oirw 
MGCTGGAGTAC AACTACATCAGC CACAACGTCTAT ATCACCGCCGAC AAcLsL^ 
G IKA HF KI RHNI BDOS VO 
GGCATCAAGGCC AACTTGAAGATC CGCCACAACATC OAGGACGGCAGC GTGCAG^C 

GACCACTACCAQ CAGAACACCCCC ATCGGCX3ACGCC CCCGTOCTOCTC CCcLcLcL: 

YLST QSAL SKDP WEKR D bmw 
TACCTGAGCACC CAQTCCGCCCXC AGCAAAGACCCC AACXIAGAAGCGC GAT^C^TcL 

LLBP VT AA GITI, OMDB LYK 
CTGCTCOAGTTC GTCACTGCCGCC GGORTCACTCTC GGCATCC3ACQAG CTGTAcLs 



Pred25: SEQIDNO:51 (Nndeic acid); SEQ ID NO:52 (Amino acid) 

MAS K GBBL PTGV VPli. VBtn 
ATGGCXftGCAAA GGAQAAGAACTC TTCACTGGAGTT GTCCCAATTCTT GTTGAATTaLt 

O DVN GHKF S VSG EGBG DAT Y 
GGTGATGTTAAC GGCCACAAGTTC TCTGTCAGTGQA GAGGGTCAAGGT GATGCAACATAC 

GKLT IiKPI cttg jclpv p 
aSAAAACTTACe CTGAAGTTCATC TGCACTACTGGC AAACTCCCTCTT CCaL^CaLa 

LVTT LCYG VQCP SRYP DHMK 
CTAGTCACTACr CTGTGCTATCGT GTTCAATCCTTT TCAAQATACCCG GATCATATCWIA 

RHDP PKS A MPEG YVQE RTIP 
CQGCATGACTTT TTCAAGAGTGCC ATGCCCGAAGGT TATQTACAGGAA AGGACCATCTTC 

PKDD GHYK TRAE VKPE GDTT 
TTCAAAGATGAC GGCAACTACAAG ACACQTGCTGAA GTCAAGTTTCAA GGTCATACCCIT 

V NRI ELKO IDPK EDGH ILQH 
GTTAATAGAATC GAGTTAAAAGQT ATTOACTTCAAQ GAAGATGQCAAC ATTCXGGGACAC 

KLEY NYHS HHV Y IHAD KOKM 
AAATTGGAATAC AACTATAACTCA CACAATGTATAC ATCATGGCAGAC AAACAAAAC^T 

OIKV NFKT RHN I EDQ S VOl.» 
GGAATCAAAGTG AACTTCAAGACC CGCCACAACATT GAAGATOGAAGC GTTCAACTAGCA 

DHYQ OMTP IGDG PVLL PDHH 
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GACCATTATCAA CAAAATACTCCA ATTGOOGATCGC CCTGTCCTTTTA CCAGACSUUXaT 

YLST QSAL SKDP HBKR DHUfv 
lACCTOTCCACA CAATCTCCCCTT TCQAAAGATCCC AACGAAAAQAQA GACCAC^TGOTC 

LLBP VTA A G I TH O MDE LYii* 
CTTCTTQAQTTT GTAACAGCTGCT GGGATTACACAT GGCATGQAIQAA CTGTACAACTM 
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PROTEASE RECOGNITION SITES 



Substrate 

Rttcognmons 

Sequencea 



Caspasc-2 



Caspase3.7 



ProCaspaseJ 



Source 



peptide library 



peptide Jtbmy 



peptide librwy 



PARP 



Reoogmtion Site 



5TrGG.TrA)GA.ACATOACAA" 
Seq:{W.UEHIV 



STXXmTAAAGAC 
AA Scq. WFKEV 



5*GACXjAACACGAC 
AA Seen PEHD/ 



Caspai5C'3 



Pn>Caspase-44 
Caspaseb 



proCflspaa-7 



CaspaseS 



peptide library 



Lamin A, 
peptide library 



Caspise^ 



peptide library 
^ptideKbrary 



prDCaspase-S 



CaspBseQ 



Adcoovirus 
cndopeptidase 



b-Sccretase 



CathepsinD 



Matrix 
Metatlopmt«.t^ 



Caspase-8 



5'GACX.AACTTCAC 
AASeq:DEVIV 



5'ATAGAAACAGAC 
AAScqMETiy 



5*TCGGTAAGACAC 
AAScq: WVRiy 

5'GTAGAAATAGAC 
AAScq: VEUV 

5'GTAGAACACGAC 
AA Scq: VEHD/ 

3'ACAOAAGTAGAC 
AASeQ:TEVD/ 



5*ATACAACCAaAC 
AA Scq: ipAtV 

5*GTAGAAACAGAC 



AAScq: VEnV. 



5TTAGAAACAGAC 
AASeq:LETIV 



P^rtidc libraiy 
Ca$pase-9 



Amyloid 
precursor 
pfotetn 



Ittagaacaccac" 

AAScqrLEHIV 

OCCQAACCdOAC 
PEPD 



5*AGCCAAAATTAC 
AASeq:SQNy/ 

S'CCAATAGTACAA 
AASeq:PrVQ/_ 



i'AUGTTTCGACGA 
AAScq: MR5G/ 

5'GCAAAAAAAAGA 
AA Scq: AKKR/ 



5*0TAAAAAUG 
AAScq. VKM/ 

3'OArCCAGAATTr 
DAEF/ 



5'AAACXACCATrATTC 
AASeq:KPALF 

STTCACATTA 
AASeQ:FRiy 

yUiACCArTAGGACCA 
AASeq:GPLCP 



SEQID 
NO 



55 
56 



Reference 



Thonibeny el aL, 1 997, J. Biol. 
Chem. 272:17907 



57 
58 



61 
62 



65 
66 
67 
68 



71 
72 
73 
74 



TlJombeny et at, 1997, 1 Biol. 
Cheiw. 272:17007 



"njonjbcny ct al.. 1 997, J. Biol. 
Chem. 272:17907 



Sendee, et al., 1 997. Biochem 
Mol Biol InL 43:755-61; 
Thombeny ct «I, 1 997, J. BioL 
Chem. 272:17907 



Tewari,M., eial., 
81:801-9. 



1995. Cell 



Thombeny, N.A. et ol., 1997; 
i-Biol. Chem. 272. 17907-1791 1 



Nakajima and Sado. 1993. 
BiochimBiopbyt Acta. 1171:3] t- 
4; Thomberrycta)., 1997, J. BioL 
Chem, 272:17907 



Fcmandcs-Alnonri, cc al., 1994. J 
Biol Chem. 269J076I-4. 



■"wnibeny, NA. « 1997,~ 
ijaiol. Chem. 272. 17907-1791 1 



75 
76 



77 
78 

79" 
80 



Muzio. M., et al.. 1996, Cell, 
85:817-27; Femandca-Alnemri, et 
al.. 1996. Proc Natl Acad Sci U S 
A. 93:7464-9;ThonAeny et al., 
1997, J. Biol. Chem. 27X17907 



Mtoio, M.. et al., 1996. Cdl. 
85:817-27; Femandes-Atocmri. et 
a)., 1996. Proc Natl Acad Sci U S 
A. 93:7464^;ThombcrTy ct al., 
1997, J. Biol. Chem. 272:17907 



85 
86 

87 
88 



89 
90 

91 
92 



93 
94 

95 

96 
97 
98 



Thombory, N«A. et al, 1997, 
J.Biol. Chem. 272, 17907-1791 1 

Thombeny. N.A. et al^ 1997, 
iSiol. Chem. 272. 17907-1701 1 



Matayoshi, ct al., 1990. Science. 
247.-954^ 



Weber and Tihanyi. 1994. 
Methods Etuymol. 244:595-604. 
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Protein Pivcunor in 
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AUhcimcrt Disease, ed. C.L. 
Maflefsetal.,pp. 190-198. 
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1 Anthrax protease 
1 Anthrax protease 


1 pcpuoc iiDrxry 
MEICI " 


S'ATAGAACXJAGAC 
_ AAScqrIEPIV 

CCCCATCCAGCTGAACCC 
AASea: MPKKKPTPIOIJN 


99 

100 

101 

102 


Thomberry el al . . 1 997» J. BioT i 

_ Chcm. 272:17907 

VitaJectaL,(1998) Biochcm 

Bio{thys Res Cbmmun 248 O). 
706-711 




MEIC2 


5'ATGCTGGrrrY:r:Ar:r A tnr^^ 

GTtKrrCCCCGCCCTCACCA'rcA 
ACCC 

AA Bar. MLARRKPVLPAI TIM 


103 
104 


Viialcelal.,(1998) Biochem j 

Bwphys Res Commun 248 Ol. 
706-711 




iUCCTTXX:AGTTTOAAACA 
AASeq:ASQFI?r 


105 
106 


McMahon et al. Nature 364046- 1 
349; Martin el aL, J. Cell Biol. In | 


1 BotuIbNim 
1 nctimtoxinA 
1 Botulinum 


1 synxpuwrcvin/ 
VAMP3 

SNAP-25 


i iiCl lUtAAI ITOAAACG 

AASeQ:ASQFFT 

5UCCAACCAACCTGCAACA 

AASea:ANq/RAT 


107 

108 
109 

no 


press 1 
Schiavoetal., (1992) Natute 1 
359,832-5 

Zhao, et at Gene 145 313- 
314 0 994) 


1 neurptoxinB 
1 Bouilmum 
1 neurotoxfnC 
1 BotultnuTD 


1 VAMP 
1 Syntixin 


A'iiC riCTCAA i ri OAAACG 
AA Sep: ASQ/FET 

3'ACGAAAAAAGCTGTGAAA 
AASco:TKK/AVIC 


ill 

112 
113 
114 


Martin ctaI.,j.Uukoc Biol. 65 1 


1 neurotoxin D 


VAMP 


^UACCAGAAGCIimCTGAG 
AA Seq: DOK/LSP 


115 
116 


(31,397-406(1999) 


1 Botulinum 
Incurotoxin E 

Botulimm) 

neurotoxin F 


jsKAP-25 
j VAMP 


S'ATCGACAGGATCATDGAG 
AASeq:iDR/IMF 

5AGAOACCAOAAGCTCTCf 

AASeq: RDCyiCLS 


117 
llg 

TW 
120 




1 Botulinum 
I neurotoxin G 


j VAMP 


5'ACGAGCGCACCCAAGTTG " " 
AASeo:TSA/AICL 


121 
122 
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PRODUCryREACTANT TARGET SEQUENCES 



Target 



! CytoplasnVcytos 
kdeton 



Inner suif act of 
tnembnute 



Nucleus 
Nudeotus 



MitochondrU 



Nuclear EnveJopc 



Target Source 
Aimextnll 



Target domain (Prodnct or Reactant) I SEQ W 

' I 

SVU-GTCTACTOTCCACGAAAKXTarcCAAG t^i" 

CTCAGCTTCCAGGGTCTTCAmTACACCCCC 
AAGTGCC3* 



Reference 

t997,Mol.BtoL 
CdI8:293fc 



famesybtton 



(Amino acid seq:M STVHEILCICLSL I 124 
EOVHSTPPSA) ' 



NFkfipSO 

nglT 



cytochiDmec 



5»AUG00Aa lALAI IAAGCCCAGAAGACAA. 
ACCAGCAGTAGAAAGAAOCAAAAUGATAGA 

?i2iS^iiF^^^^GAGAAGACC3GAGAAAA 
ACCTGCTAGA3* 

I (AAseq:M OCTLSAEDKAAVER 
(SKMIDRNLREDOEKAAR 

S'AGAAGGAAACXJACAAAAG ' 
(AAieq:RRKRQK) 

TCnCItXUCXn-ATTTXnTTCXX!AC^^ 

I "t'RTYLKSCRRMK 
iRSOFEM SRPIPSH l-n 



125 



126 

127 
128 



130 



£^^^TccrGA^CGCTG5feci^^ 

GGCTTOACACGCTCXX3CCCGC 
TGCCGCXXXXXAAGATCC^VTrcCTTC 



Golgi 



ODV"E66A 
ODV-E25 



(AASeqrM SVLTPLLLRGLTGS 
A_RR L PVPRALIHj!!^ 

n 



132 



f 5'AUGAGCATTG 

rm 



T^. TAAT AATTGTrATTTGGA | 133" 

TTAATATGTrrrTTATAnTAAGCAACA 



Cairctkulm 



GCAAAGATCCCAGAGTACCAGTTGAATTAAU 



(AASeq:M SIVLIIVJVVIFLICF 
L YLSNS K_D P RVP V E 



AGCGCCGCQATCCCAGGCXXXnCCCTAS^ 

S^2S3H5P^cocTCGm 

CTTOTCGGACTCTCCACACCGCTTGC^ 

5E5^^^^^^^°^^CGCCGCCATCGGG^ 

CTCCGGGGACCTCCGOACCGGAGGOGCC 

(AASeq:M RLREPLLSGSAAMP 
G A SLQRACRLLVAVCALHLOVTL 
VYYLAGRDLSR.LPQLVGVSTPLOG 
GSNSAAAIfiOSSGELRTnnV/^ 
^;?^^^^UAAGAOT^ 

i^^JJIi^^'^^^^^^A^^^AAUGGCAA 

CCAGOAAlWTTAGCmAmc^ 
^^CTTnTCAGTACAAAAAAA 

I^^^^PFPLAtPOMLALLGWWW 



134 



Endopbsinic 
reticulum 



Nudear Export 



O^KAPI 



MEK1 



135 



136 



FemiccioG,etil, 
J. Biol. Chem 274, , 
5843*5850, 1999 



Hcnltel.Tetal., 
Cell 68. 1121- 
1 133. 199? 
LW£i.eiaJ..1998. 
Biochem Biophyt 
RetCommun. 
252.'97-ia2. 



Rimito,etaL, 
I989.J Bid Chem. 
264:10595-600. 



Hong. Thecal. 
PNAS, 94.4050- 
4055,1997 



Rwgcl.L.etal.,J. 

Birf.Chefa264, 

21S22-2I528, 



Hunng, U. £t al^ 
J. Cell. Biol. 145, 
951-959,1999 



S'GarCTCCAGAAGAAGCTGGAGGAGCr" 
I AGAGCTTGATSAG 



138 



139 



Fukuda. (1997) 
X Blot Chem 



FIGURE 29C 
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I Sizeexdusion 



(AA SBQtA L Q K K L 
I I« D S 



B B L B 



140 



272.51.32642- 
3264a 



I PROJ domain of 
MAP4 



Vesicle 
membrane 



I SynaptobrevST 



5'CCXX5ACC IL'AG imCTGCATCCCTTGACA" 
GAACCACCTOCAGAAATTGACGGAGAAATAA 
AGCGAGACTTCATGGCTXSCCKrrOGAGCCAOA 
GOXTATOATGACATCGTCCKiACAAACTGTC 
OAOAAAACTGAGTTrATTOCTC rttTOOATDG 
TOATOAOAAAACCGGGAACTCAGAGTCCAAA 
AAGAAACCCTGOTAGACACTACCCAGGTra 
AAGGTATCCCATCTTCTAAACCAACACTCCTA 
GCCAATCOTQATCATGGAATGGAGCGGAATA 
ACACTCCACGCrrCTCCAACTCACTrCCrraAA 
OAGAGAGTGGACTATCCGGATTAirAGAGCA 
CCCAGAACTGGCCAGAAOATGCAAGCTnTC 
mcCAGCCTCAGCAAGTGrrAGATACTOACC 
AGGCTOAOCrCTrrAACGACCACCCTGAT^ 
Tf^°GmCGCAGATCrGCrCTrrGKT^ 
GACCt^CCAAaJCTrcTGCATTTACAOAGCG 
AGACAATCCTrCAGAAGACAGTTACGOTATO 

cnrarixnGACTrATntKrnrcACccc^ 

TGTVITCTC^ 

AACTCTOIATGrrCAGAGTCCTCTOTCKXXX: 

agaggtt>ctatagaaaccctacagcx:agca 

ACAGAGCTCTCCAAGCCAGCAGAAGTGGAAT 
CACTGAAACACCAGCTOCCAGCTAAAGCATr 
GGAAAa?ATOOCAGAGCAGACCACTOATOTO 
CTGCACTCTCCATCCACAOACACAACACCAO 
CCCCAGACACAOAGGCAGCACTCCCTAAAGA 
CATAGAAGAGATCACCAAGCCAGATCTOATA 
TTGGCAAATCTCACXjCAGCCATCTACroAAT 
CGGATATGrrcCrGGOCCAGGACATCCAACT 
ACTCACAGOAACAGAGGCAGCXrACCCTAAC 
AATATCATATTGCCTACAGAACCAGACGAAT 
CrrCAACCAAGGATGTAGCACCACCTATXKJA 
AGAAGAAAT7CTCCCAGGCAATCATA 

I (AASEQ:ADLSLVDALTEPPPEIEGEI 
KRDFMAALEAEPYDDIVGETVBICT 
EFIPLLDGDEKTGNSESKKKPCLD 
TSQVEGIPSSKPTLLANQDHOMEG 
NNTAGSPTDFLEERVDYPDVQSS 
QNWPEDASFCFQPQQVLDTDQAE 
PFNEHRDDGLADLLFVSSGPTNAS 
AFTERDNPSEDSYCMLPCDSFAST 
AVVSQEWSVGAPNSPCSESC VSP 
EVTIETLQPATELSICAAEVESVKEO 
LPAKALETMAEQTTDVVHSPSTDT 
TPGPDTEAALAICDIEEITKPDVILA 
NVTQPSTESDMFLAQDMELLTGTE 
AAHANNIILPTEPDESSTKDVAPPM 
EEEIVPGNDTTSPKETETtUPIICMD 
LAPPED VLLTICETELA PA KGMVSL 
SEIEEALAKN.DVRSAEIPVAOETV 
VSETEVVLATE V VLPS DPITTLTK 
?.Vi:^i:^^^'^'*^^^^MTPSLETEM 
I TLGKETAPPTETNLGMAKDMSPLP 
ESEVTLGKDVVILPETKVAEFNNV 
I TPLSEEEVTSVICDMSPSAETEAPL 
AKNADLHSGTELIVDNSMAPASDL 
ALPLETKVATVPIKDICn 

' ATGTGGGCaUlTCGGGATTACTGTTCT 



141 



142 



GGTTATCTTOlTCaiTCATCATCATC™ 
TCGGTTGTC 



143 



(AA SEQ: MWAIGITVLV I144 
I F I I I I I V W V V) 



West, (1991). J 
Biol Chem 
266(32): 21686- 
96: Olson. K. R. 
(1993). JCeO 
Biol 130(3): 639- 
50. 



Schlavoetal^ 
(1992) ^tettlre 
359,832-5 
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Veslda 
membfane 

Nudear Export 


WVltUUlBVUI 


5 ' ATGTGGGCGATAGGCSATCAGTGTCCT 
GGTGATCATTGTC!ATPaTY>A'rr»R -rn.ni.- . 
TGGTGTG 

(AA SEQ: MWAIGISVLV 

iiviiiivwc) 


145 
146 


MCMahonet aL 
Nature 36434&.* 
349: Marfln et at, 
J-CeOBklln 
press 


ferootisome 


MEK2 


5 ' GAU^ixiCAGAAGAAGCTGGAGGAGCT 
GGAACTTGACGEAG 

JASEQ: DLOKKLEEI.PT nc 


147 
148 


i. B»I.Chem. 

268:11435-11439, 

1993 






5'TCTAAACTG ' 

AA SEO: S K L 


149 . 
150 


Amciyetal^ 
BiocheniL J. 
336J67-37I 








(1998) 



M.c™tub«.es(MAP4,SEQmNO:,51 (Nucleic add); SEQ ID N0:,S2 i^.,^ 
MAP4: 

TVBK TBPI PLLD 
BSKK KPCL DTSO vbo 

=^ JJJSS S = 

= S^I^' = ~ ™^ 

TDFL EERV DYpn Vno „ 
^NBHRDDGLADl.Tr, „ 

TTTAACGAGCAC CX3TGATGATGGT TTOGCAGATCTr rlr,jLL S G P T 

™^ ~s ssss sss 

™c™, 
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s ~ 

HS^' ^«^«Lic:r l^LLL. IvZr^ 

=?b ss^sii 

TCACTCTCAC3AA ATAGAAGAGGCT ^IgScaLsLiT SatLtSgcLt Jr»«» Jo.™ 
AGTGACAQTCTT TATCTTrTrfva *^^^^™=eCTCT GCAGAAATACCT 

««wuiTClT TATCrrCTCCGA GACCGTTTCTTA CTACAAC3CGAGA CGTCTTTATGGA 



VAQE TVVS ETB V V 
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s~ ~ H= ~ 

= VTL GKDV VI LP ETvw 
GAAGT(»CTCrra (WCAAGQAOrro GTTATACTTCCA GA»Ir^«»™ E P N 
CTTCC^C CCCTTCCTGCC SJ^SJ^ SSSS^ 

»VTp LS BS BVTS Vkdm . 
AATGTOACTCCH CTTrOiaMOM GAOOTWlOTm OTrL^;.,.™ ' 

™«CT™==T C^CTC^^ SSSS 
= = = 

NLEQ KETP GSQp SEPp e 

SJS JSSS s= 



NKBP PPSP BKK. A K 



AKTS TSKA KTOp tqtb ^ 
GCAAAGACTTCA ACATCGAAAGCC AAAACACAGCCC APto™^ K Q P A 

=-™t<»^ ™T»ocr™« ssss 
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T T S 6 G 



^^^^^^^^^^^ 

°™*»»aac LiJocJc^^ toSct^!^ " " « ' i p 

ARDV KPKP I TEA T 

GCCAGAGACGTG AAGCCAAAGCCA ATTACAQAaor,. f*^^ ^ ^ K . R t S 

---- T^o, .SSS JSSS 

PSKP SSAP ALVB 

CCATCCAAGCCT TCATCTGCCCCA GCCCTcLJr^ L ^ ^ TPtv 

--o™«=. ^sss sss 

SKAT S P s T LVjcj 

-^ccc^ sss 

•^TLP KHPT STvm 

ACAACTCTGCCT AAGAGGCC31ACC AGCATCWVrrr-r ° ^ ^ ADVR 
«™«»CGaA TTCTCCGG^ ^^'^^ GCIGA1X3TCAAA 

WTACSTTCTGA CTCCCCTrXGGA CQACTACaGTTT 
Rmta KSAS a 

AGGATGACTGCT AAGTCTGCCTCA cJLJln-nlnn. "^^T TSAS 

~- SSSS SSiSS 

^^"^^^"^^TGAAGAGA AACACCAnrrrr ar-rX^^ PPAG MTST 

^''KPMSAPSBeo 

«»^°TCAAGCCX: ATGTCTOCACCT AGCCGCTt-rJr-r 1 A L S V D K K 

««uftflfcA CCXXX3AGAAAGA CACCTGTTCTTC 
CCCACTTCCaCT MGCCOdSmiC ?f-iinJ^«.f« VSRL ATTV 

™= sss sss 

^*PD LKSV RBir. „ » 

TCTGCCCCTGAC CTCAAGAGTGTT CGCTCnAAfiXm = MlRH 

«C™. 

sss^ 

^ «^TCTCTTTTTT TGTCTCCGTCGA TGGTGTCGACCC 

JAOCn^AA^CT LTCCA^Itin« L^CaScc^C S A Q K P 

-ooa™. ^^^^ ^^^^ 

^AQKV QIVSK»„ 

C^CmJGOAAA GTCCAOATAOTA TCOUVaL^ f^^^ H I q S K 

VHAAAAAGTG AGCTACAOTCAT ATTCAATCCAAG 
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ooACOAcccTTT cacc«:tatcat aggttttttcac tcoatx^^ x.Acn^;u^ 



KPGG GDV K IBS 



AAGCCTGGTGGA GOAaATGTCAAG ATTraa.n^»^ f . ' " ^ K B K A 

ccxcS 

QAKV 6SLD HVOH pb»« 

S= i~ 



Q B T S I * 
CAGGAGACAAGC ATCTAA 
CTCCnCTGTTCG TAOATT 
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[ Signj Fmtease Recognition 



Sequence 



Rcactant Tai^et 
Sequoice 



[ Signal: 



Sequence 



Protease Recognition 
Scqncttce 



Signal 1 J Reactant Target 



Seqneni 



ce 



Signal 



ftoduct Taiget 
Sequence 



Protease Rccoj 



'gnition 



^^cactant Taiget 



] 



FIGURE 3(r 
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FIGURE 31 
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/A?^* ^ transfeded wllh DEVI>caspase biosensor 
W Cete before stimulatian of apoptosis. (B) Another field of 
cells atosteroilattonwfflh 250 
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Signal 


Product Target 
Sequence 


Prote^ Recognttum 
SequcDce 


Reactant Target f 
Scqnence 1 









EGFP 


Nuclear 
Localization 
Signal 


Caspase-3 


Annexin 11(1-23) 




► 



33 
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5 



Signal 


I^roduct Taiget 
ScqocQce 


Protease Recognition 


JEl«actant Taiget 








Sequence 



EGFP 


Nucleolar 








Caspase-8 











y I h'l I II ft >i i| i^i^ mi 
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Signal 1 



Product Taiget 
Sequence 



Protease 

Recognttum 

Sequence 




EYFP 


Nuclear 
Localizatioa Site 


Caspas&.3 


ECFP 


Anncrin 11(1-23) 



2>f 
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C 

o 
< 

CO 



70 

60 

50 

40 

30 

20 

10 h 



0 

70 
60 
50 

40 

30 

20 

10 



373 cells 



Control 
TNF 



nisonpfdn 




IL1 

X'^P + AnieomwIn 
IL1 + Anisomycin 



IL 



SNB19 cells 



S Control 



IL1 + Aniaomyan 




NFKB 



pp38 



ievel(p<0.Ol). 
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SEQUENCE LISTING 

<110> Giuliano, Kenneth A. 
Kapur, Ravi 

<120> A System for Cell Based Screening 
<130> 97'022-L 

<140> To Be Assigned 
<14l> Filed Herewith 

<160> 180 

<170> Patentin Ver, 2.0 

<210> 1 
<2H> 1770 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (882) 

<220> 

<223> Description of Artificial Sequence- 
GFP-DEVD-Annexin II construct 

<400> 1 

^? ;s s z - - s! s I- - 

?5 S2 L-p f.t - - - - 
IS s= - z: s ?s :s - ?s s it: 

40 45 

Ss ?S I?3 f^^ f 3 ""^ 9tg ccc tgg ccc acc etc gtg acc acc 
Cys Thr Thr Gly Lye Leu Pro Val Pro Trp Pro Thr Leu Va! Thr Tte 

55 go 

^ 2^ s: - - ^ - 

85 90 95 

S tS Ph! f ^^'^ ^^'^ ^^'^ ^"^^ 3CC gag 

105 HQ 

"f s S2 ?s q ?s s? - - K - s g= 



96 



144 



192 



240 



288 



336 



384 



1 
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4B0 



528 



576 



K K 15 1?. IS Z S K S if^ SI 2 IS 

s= SI z s - - - - - ^= 

155 

s ?:? ^ SI 5! ?s s SI SI ;;i is s; irj si 

1« 

va? f f ^^'^ ''^'^ ^'^S ccc ate ggc gac qpc 

Val am Leu Asp His Tyr Gin Gin Asn Thr Pro He Sy 111 ^ 

180 f y 

Pro 2^ 2^ p^^: r T ^^'^ '^^^^ 

i.eu fro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
200 205 

SI SI SI III SI IS 1?. IS S SI ^? LS S IS ^ 

215 220 

va? ?^ Ii%^ ^ 2: I?? -3 

225 I-e" Tyr Lys Ser 

235 240 

i!j III s sii ig SI i?i - 1?" IS m m z SI i?j IS 

245 250 255 

SI IS m SI SI SI zi IS SI IS si s s 

260 265 270 

^ SI IS SI SI j?i SI 21 - - IS ig SI SI IS 

280 285 

^ Pro pS C '"'"^^'^^^ ^^^^'^^^"'^ 

290 

accggggtgg tgcccatcct ggtcgagctg gacggcgacg taaacggcca caagttcagc 972 
gtgtccggcg agggcgaggg cgatgccacc tacggcaagc tgaccctgaa gttcatctgc 1032 
accaccggca agctgcccgt gccctggccc accctcgtga ccaccctgac ctacggcgtg 1092 
cagtgcttca gccgctaccc cgaccacatg aagcagcacg acttcttcaa gtccgccatg 1152 
cccgaaggct acgtccagga gcgcaccatc ttcttcaagg acgacggcaa ctacaagacc 1212 
cgcgccgagg tgaagttcga gggcgacacc ctggtgaacc gcatcgagct gaagggcatc 1272 
gacttcaagg aggacggcaa catcctgggg cacaagctgg agtacaacta caacagccac 1332 
aacgtctata tcatggccga caagcagaag aacggcatca aggtgaactt caagatccgc 1392 



720 



768 



816 



864 



2 
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cacaacatcg aggacggcag cgtgcagctc gccgaccact accagcagaa cacccccatc 1452 
ggcgacggcc ccgtgctgct gcccgacaac cactacctga gcacccagtc cgccctgagc 1512 
aaagacccca acgagaagcg cgatcacatg gtcctgctgg agttcgtgac cgccgccggg 1572 
atcactctcg gcatggacga gctgtacaag tccggactca gatctggcgc cggcgctgga 1632 
gccggagctg gcgccggagc cgacgaggtg gacggcgccg gcgccgatga agtagatggc 1692 
gccatgtcta ctgtccacga aatcctgtgc aagctcagct tggagggtga tcattctaca 1752 
cccccaagtg cctattga 



1770 



<210> 2 
<211> 294 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
GPP-DEVD-Annexin II construct 

<400> 2 

Met val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
5 10 15 



Val Glu Leu Asp Gly Asp Val Aan Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

'^^ 80 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



85 



90 



95 



Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
lOO 105 110 



Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 



"5 140 



Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 



155 



160 



Gly lie Lys Val Asn Phe Lys He Arg His Asn lie Glu Asp Gly Ser 

170 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 



190 



3 
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pro val I.U Leu Pro Asp Asn His Ty. .eu Ser Thr Gin Ser Ala Leu 

200 205 

ser Ly. Asp Pro A.n Glu tys Arg Asp His Met Val Leu Leu Glu Phe 

2-^5 220 
val rtr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 

235 240 
Gly Leu ATS ser Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala 

255 

ASP Glu val ASP Gly Ala Gly Ala Asp Glu Val Asp Gly Ala Met Ser 

270 

Thr val His Glu Xle Leu Cys Lys Leu Ser Leu Glu Gly Asp His Ser 



280 285 



Thr Pro Pro Ser Ala Tyr 
290 



<210> 3 
<211> 2439 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (2436) 

<220> 



<223> Description of Artificial Sequence- 
EYFP-DEVD-MAPKDM construct 

<400> 3 

^ ?3 ttl l^. S2 - - s IS ?3 52 1- ill - " 

10 15 

s 22 s z; f.i s= s - - - - z - 

25 3Q 

IS i| m s s s ^ s - s tli 

60 

IS =^ ^ s ^ s: s? 

■'S 80 

IS K 2^ S Z I" - - 

90 35 
cgc ace ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc 



144 



192 



240 



288 



gag 336 



4 
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Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He 5lu Leu Lys tly 
1" 120 125 

Til V 11" f®^ ^^"^ ^^'^ ''«=9 999 aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His LyI Leu llu 1^ 
130 135 140 ^ 

aac tac aac age cac aac gtc tat ate atg gee gae aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Lp Lys Gl? Lys J^n 
"° 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gae ggc aqc 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Lp Sy Ser 
165 170 175 

ri^ T'' f^'' ''t^ ""^^ °*9 aac aec ecc ate ggc gac ggc 576 
val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Lp 
180 185 190 

Pro 111 T*"^ r^^ ^^'^ ^'^'^ =''=9 age tae eag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser La Leu 
135 200 205 

age aaa gae cce aac gag aag ege gat cac atg gtc ctg ctg gag ttc 672 
ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu pSe 

215 220 

v2 Th^ f?*' ^f"" '^''^ ^"=9 9ac gag ctg tac aag aag 720 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 
230 235 240 

gga gac gaa gtg gac gga gee gae etc agt ett gtg gat geg ttq aea 76fl 
Gly Asp Glu val Asp Gly Ala Asp Leu Ser Leu Vai Lp Wa Leu ^ 
245 250 255 

gaa cca cet eea gaa att gag gga gaa ata aag ega gac ttc atg get 816 
Glu Pro Pro Pro Glu He Glu Gly Glu He Lys A^g Lp Phe Me? Sa 
2«0 265 270 

geg ctg gag gca gag cec tat gat gac ate gtg gga gaa act gtg gag 
Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val ciy llu Thr Vaf Glu 
275 280 285 

^- TK^ IV" ^^"^ '=*^9 gat ggt gat gag aaa ace ggg aac 

Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr lly Asn 

295 300 

lit ni?, 0''° f** *^9C tta gac act age cag gtt gaa ggt 

ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Slu IS 



864 



912 



960 



310 315 



320 



ate cca tet tct aaa cca aea etc eta gee aat ggt gat eat gga atq 
He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn liy Lp His tly nil 
325 330 335 

rif. n^^ ^^'^ 3ca ggg tct cca act gac ttc ett gaa gag aga 
Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu 111 ^ 



1008 



1056 



5 



1152 



1200 
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340 

350 

365 

S J£ ™ S» ^ j« ,„ e., 

370 ^"-"^ ^^-^ Asp Thr Asp Gin Ala Glu 

""^^ 380 

p s; s s L-j K Ty If- L" 

385 ^-^y I-eu Ala Asp Leu Leu Phe Val 

400 

tec agt gga ccc acg aac get tct t-*.^ 

S» 0., ^? - - t« J. cc. .... 

415 

s s .'2 ^= s 2: s - r r= - 

420 .->c ^ ^® Ser Thr 

430 

get gtt gta tct cag gaq too tct m-r. « 
Ala val val Ser Gin Ilu S tTr Va! g^^ lit T 

435 ^ 44 Q ^"^y Ala Pro Asn Ser Pro Cys 

445 

l« S? I~ f.t - I- J« - «. |a. a„ ct. c.e cc. 

450 ^^-^ lie Glu Thr Leu Gin Pro 

460 

gca aca gag etc tec aag gca aca aaa 

«. .1. ^„ s., s: 2?- ?3 ii: - - ij. eg 

480 

S K S LI S S IS =S S S S 

495 

S? IZ ?S 3f cca .ac aca gca 

500 ^ „^ P'^o A«P Thr Glu Ala 

^"^ 510 

s: IS us IS s: si s ?s r =^ 

515 „„ Pi^o Asp Val He Leu 

525 

^; ^a? ?s r ^^'^ 9- 

530 ^ Ser Asp Met Phe Leu La Gin 

'•"^ 540 

Z M^f Le" tl ?S Zl T r ^^'^ 

545 ^•'y ^ Aia Ala His Ala Asn Asn He 

55P 

- is s I s ir: s:^ s s "i l'S 

^''^ 575 

S^S?"?So^S?SSS^L':iS?S?SL-S.^ 



1296 



1344 



1392 



1440 



1536 



1584 



1632 



1680 
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2064 



2112 



I" ?s g ?s s 2: ^: s: s - s ?s s - 

^^'^ 605 

^. ?s - - s ^ s 1?= «? 

^■^^ 620 

1 1" ^ ir: is is e s s - - s 

640 

K s: s ?2 s; ^ ?:? - - - 

655 

va? Si 22 s ir: p^:: r ^ aca ao^^ 

660 ^ He Thr Thr 

670 

ttg aca aag gat gtg aca etc ccc 

^ ^ 2S S- s SI S pS S 

685 

?s s° ?s - - 1^ ct. 

690 ^■'■^ "^^ I^eu Gly Lys Glu 

700 

aca get cca ccc aca aaa aca aa<- 

M. .„ ?s s - - ^ «| - 

"'^^ 720 

2: s« S 22 T r ^" 

725 val He Leu 

735 

s ^ ;s - - ?s j:: 2: is 
I" IS |; ?s L^j ?it i s - s s SI 

765 

z s L" ni s IS s I?; SI - - ;s 

780 

9tg gac aac age ata act cm ..^ 

val ASP Asn sL Me? Il'a p" Ii": S 2^ T ^"«> 
785 Asp Leu Ala Leu Pro Leu Glu 

800 

z fA s IS ;s si iii si 



2208 



2256 



2304 



2352 



<210> 4 
<211> 812 
<212> PRT 
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<213> Artificial Sequence 



<220> 
<223> 



Description of Artificial Sequence- 
EYPP-DEVD-MAPKDM construct 



<:400> 4 



«=t «1 Uy, Oly 3a„ L.„ «„ ™, ^ ^ 

-.1 «u oly val »„ cly „. L,. Pt. VjJ s.^ oa, 

=lu ai, clu ,=1, ^ ^ ^„ ^ 

<V. ^ CI, .y. p P„ p„ Z ^ 

60 

«V T,. CI, ^ 01. C= P^ ^ ^ „^ 

''^ 80 
01. «i. ^ p.. p.. ^ ^ 

X., u. Pj. p.. ^ ^ ^ ^ ^ 2 „a. 

110 

v.. .y. P., ex. cix ^ ... ^ 

"e «p p.e Ly. CI. 01, xi. oi, HI. Z ^ 01. T„ 

140 

^ ser His Val ryr He Met Ma .sp .y. oin X.ys 

160 

Gly lie Lys Val Asn Phe Lvs ti« t,- , 

ne Lys He Arg Hx^ Asn He Glu Asp Gly Ser 

175 

V.1 01. 1.. „. „. ^ ^ ^ 

190 

P~ V.1 J.. p„ ^ ... s., ^ 01. s., „. x^ 

205 

«er .y. p,. ^ 

^■^^ 220 
v.. ,^ „. .1. 01, ^ ^„ 01. 1^ 1.,. 

240 

ci. v.1 01, „. ^p ^„ ^ ^. ^ 

255 

«. P~ P™ P^ 01. „e 01. 01, 01. .1. ^ ^„ ^ ^ 

270 

«. L» 01. «. 01. Pro ^ ;„p „. „^ ^ _ 



e 
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Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 

Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
"° 315 

He Pro ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 

330 335 

Glu Gly Asn A.n Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 

345 350 

val ASP Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 



360 



365 



ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 

^'^ 380 

Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 



395 



400 



ser ser Gly Pro Thr Asn Ala Ser Ala Phe Glu Arg Asp Asn Pro 
"5 410 415 

ser Glu ASP ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Ttr 

Ala val val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 

440 445 

ser Glu ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 



460 



Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 

475 480 

Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin ThT Thr Asp Val 
*"5 430 ..r 



495 



val His ser Pro Ser Thr Asp Thr rbx Pro Gly Pro Asp Thr Glu Ala 

505 510 

Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He Leu 

520 

Ala Asn val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 
ASP Met Glu Leu Leu Thr Gly Thr Glu Ala Ala His Ala Asn Asn He 



lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val 

565 



570 



Ala Pro 



575 

Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 

585 550 

Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 

600 605 
Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly 



Met 
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Val ser X.eu Ser Glu XXe Glu Olu Ma X.eu Ala .ys val 

"5 640 
0.. „. ^ ^ 

655 

Val Val Leu Ala Thr Glu Val Vai t-.» « 

660 ""^^ Asp Pro He Thr Thr 

670 

685 

J.P ™, P„ g ^ ^ ^ 

''^^ 700 

^ P.O «o ^ cj„ ^„ 

''•'^^ 720 

Pro Leu Pro Glu Ser Glu Val Th*. t 

^er Glu val Thr Leu Gly Lys Asp Val Val lie Leu 

735 

Pro 0I„ ™, ,^ ^ ^ 

750 

Cl. 01„ V.X V.X .y. „ „^ ^ 

765 

Ala Pro I.U Ala .y. Asn Ala Asp .eu His Se. Cly ^ oiu X,eu lie 

780 

Val Asp Asn Ser Met Ala Pro Ala s**r x 

785 ''''^ Asp Leu Ala Leu Pro Leu Glu 

"'^^ 800 
Thr Lys Val Ala Val Pro lie Lys Asp Lys Gly 



810 



<210> 5 
<211> 2439 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(2436) 

<220> 

^^^^^ ^^"^^'^^'^ Artificial sequence. 
Eypp-DKAD-MAPKDM Construct ^ 



<400> 5 

S SI Z IS S S IS - - 1- - « 



Pro He Leu 
10 15 



?.1 S L-^ - - - - - - 2? " 

30 



10 
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288 



336 



110 

ggc 384 



?S I?; m Z SI ^'^r.^.^.'^ - J- JL= il: - 

40 45 

s z s ^^'^ 

50 jr ir-ro vai Pro Trp Pro Thr Leu Val Thr Thr 

55 60 

s i!j ^ s; s S 5^ L-^ S= ^1 - - 

80 

s s: K s ^' s: s - - - si ij; 

S ^= SI IS S L'^ ^ S S S 
5f ?S 53 ^: 2^ - - 

125 

g in L- III IS SI S IK ^ 

^•^5 140 

s^^^ H i^i ti: ^11 !ii r - 

145 ^ ^^'^ Asn 

160 

s ni SI s? E£: - -I - SI SI ?ji is m sii 

170 3^75 

SI 21 S SI SI SI SI SI SI III f L= IS - - 

190 

m ?s 21 s SI SI SI SI SI III SI SI ^11 SI s 

2"" 205 

s ™ SI m SI SI :;i SI si si ;ii ;s i" "-^ 

210 of^ ^ ^eu Glu Phe 

^•'^^ 220 

val ^ i^: i,^: i?^ ^ o^?j ^.^^ r i?^ 

225 ^ '^■^y A«P Gl« Leu Tyr Lys Pro 

"° 235 240 

5 SI I" SI SI SI SI SI 21 SI 21 C SI 21 SI 

^50 255 

I" SI 12 III IS S SI - 1- - - ^ ^„ 

270 

gcg ctg gag gca gag ccc tat gat gac ate gtg gga gaa act gtg gag 



432 



480 



528 



720 



768 



816 
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Ala Leu Glu Ma GIu Pro Tyr Jap He Val Oly ciu Thr Val oiu 

285 

^ ^ SI z iv, 2: a s z til r "' 

290 I-ys Thr Gly Asn 

300 

tea gag tec aaa aag aaa ccc tan i-i-a 

s=, ^ .4 s L': s s - s "° 

320 

s s K :^ ^ s 2; s l-s - - s i'l 

335 

gag ggg aat aac act gca ggg tct cca ant- 

cl, ^, »^ - - - ^. 

350 

?3 L- I- - - .| IS L- s s- s s: 

365 

sir JS^ III - ^2 IS ^ f'^ '^^^ 

370 ^ ^^-^ Thr Asp Gin Ala Glu 

380 

^ IJS S L"^ oT, LI S: L" r= 

385 ^gn ^ ^ ^ I'SU Phe Val 

395 

400 

1!- s S LI IS IS s s s 

s K= ?s LI - - - - - 2 ^ 

430 

ii" s.^ s^^j oil ii^^ g -J m r r '^^^ 

435 ^Jn ^ ^ Cys 

445 

s - s I?; ?s s ?s s s 

460 

?s s IS s* IS ?3 r r - 

465 Val Glu Ser Val Lys Glu Gin 

LI ^ S SI 2S S L'f it: is si ?s S - - 

490 

gtg cac tct cca tec aca aac aca ar-a 

v.. Hi. 3., se, ™. i:^ ?s ?s i!= ss IS s IS IS 

510 

s SI SI L- ;s si ;s s is s si ?s si 



1104 



1152 



1200 



1248 



1392 



1440 



1488 



1536 



12 



1728 



1776 
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520 

s: v-r ?s ^ ^ K - - - IS ;2 js: - s 

540 

^ s iL- 2: ^ ?s s- s s i:: t. 

"° 555 S60 

;s s s s: s° s s ?s z; ?s s ^ 

^''^ 575 

Si ^: r r*^ ^'^^ 

580 fir T^'^ Thr Ser Pro Lys 

iiu' ^ p^^: III - cot .e24 

595 ^^"^ I-ys Met Asp Leu Ala Pro Pro Glu 

605 

IS s L-: 2: s° 2- ^ r s: K s: s - - - 

620 

VaJ s"r 2u Se" ll^^ I^u^ f," ^ '^^'^ 

625 Ala Lys Asn Asp Val Arg 

640 

it: oS s 53 ;3 - is ?s 

"0 655 

Si ii^: ^ iTu Si f ^ r '^'^^ 

660 ^^'^^ *«P lie Thr Thr 

665 

ttg aca aag gat gtg aca etc ccc tta 
Leu Thr Lys Asp Val Thr Leu Pro Leu 
«'5 680 

s 2p Vri a IS ?s ;i ^ 

690 " Thr heu Gly Lys Glu 

700 

s s ?s ^ s 5 ?s j^: - - ^1 1^^^^ ^« 
s j:: is s-. s s: ;3 21 q - - «• 5 

''^^ 735 

S 5^ S? tit S2 L-s s ^ - - - 

'^^^ 750 
gaa gag gta acc tea gtc aag qac ato ^rl- r.^« ^ ^ 

al« 0l„ „„ se, ^1 .vl Sp' S? IS S IS ^ S2 



1968 



2016 



670 

s ?^ LI s? 2u pr: 2:: ii^ r '^'^^ 

675 ^9 I^eu Val 

685 

2112 

I*eu Gly Lvs Glu 

700 



2160 



2208 



2304 



765 
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get ccc ctg get aag aat get gat ctg cac tea gga aca gag etg att 2352 
Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu He 
770 775 780 

9tg gac aac age atg get eea gee tec gat ett gca ctg ccc ttg gaa 2400 
Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
785 790 795 800 

aca aaa gta gca aca gtt cca att aaa gac aaa gga tga 2439 
Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 6 
<211> 812 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
EYFP-DEAD-MAPKDM construct 

<400> 6 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
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195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Pro 
225 230 235 240 

Arg Asp Glu Ala Asp Ser Ala Asp Leu Ser Leu Val Asp Ala Leu Thr 
245 250 255 

Glu Pro Pro Pro Glu He Glu Gly Glu He Lys Arg Asp Phe Met Ala 
260 265 270 

Ala Leu Glu Ala Glu Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu 
275 280 285 

Lys Thr Glu Phe He Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn 
290 295 300 

Ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly 
305 310 315 320 

He Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met 
325 330 335 

Glu Gly Asn Asn Thr Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg 
340 345 350 

Val Asp Tyr Pro Asp Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala 
355 360 365 

Ser Phe Cys Phe Gin Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu 
370 375 380 

Pro Phe Asn Glu His Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val 
385 390 395 400 

Ser Ser Gly Pro Thr Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro 
405 410 415 

Ser Glu Asp Ser Tyr Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr 
420 425 430 

Ala Val Val Ser Gin Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys 
435 440 445 

Ser Glu Ser Cys Val Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro 
450 455 460 

Ala Thr Glu Leu Ser Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin 
465 470 475 480 

Leu Pro Ala Lys Ala Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val 
485 490 495 

Val His Ser Pro Ser Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala 
500 505 510 

Ala Leu Ala Lys Asp He Glu Glu He Thr Lys Pro Asp Val He Leu 
515 520 525 
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Ala Asn Val Thr Gin Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin 

535 540 

Asp Met Glu Leu Leu Thr Qly Thr Glu Ala Ala His Ala Asn Asn He 
550 555 

He Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro 
565 570 

Pro Met Glu Glu Glu He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys 
580 585 590 

Glu Thr Glu Thr Thr Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu 
595 600 605 

Asp Val Leu Leu Thr Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met 

615 620 

Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg 
"0 635 640 

Ser Ala Glu He Pro Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu 
645 650 655 

Val Val Leu Ala Thr Glu Val Val Leu Pro Ser Asp Pro He Thr Thr 

665 670 

Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val 
675 680 685 

Thr Asp Met Thr Pro Ser Leu Glu Thr Glu Met Thr Leu Qly Lys Glu 
690 695 700 

Thr Ala Pro Pro Thr Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser 
'10 715 720 

Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys Asp Val Val He Leu 
■^25 730 735 

Pro Glu Thr Lys Val Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu 
'40 745 750 

Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu 
755 760 765 

Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu He 
"° 775 790 



Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu 
'90 795 

Thr Lys Val Ala Thr Val Pro He Lys Asp Lys Gly 
805 810 



<210> 7 
<:211> 864 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<221> CDS 

<222> (1)..(861) 

<220> 

cSSct™ ^--J-ence: P25-MEK1 

<400> 7 



s^? s ^ IS jj: - - is 
s: s S5 1:^ l-^ - - s 

30 

IS I!J - - - S ^ s- - 

tgc act act ggc aaa ctg cct qtt cca ^flr, 

^ ^ ^ tit '^^tii^ s ?s s 

60 

ctg tgc tat ggt gtt caa tac t-t-*- t-^^ 

- ^ .'S ^ I- - - 

80 

s s.' js: 1= SI - IS 15 v-L- SI IS 

^° 95 
agg acc ate ttc ttc aaa gat gac qoc aan i-.^ 

^ n. Lp Lp° ^ S^l ?S S IS 

110 

gtc aag ttt gaa ggt gat acc ctt gtt aat an, »^ 
V.I .y. Pje ^ ^ - „. 

125 

Xl" I^p JL' 1^. III - aaa ttg gaa tac 

130 ^ Sly His Lys Leu Glu Tyr 

140 

aac tat aac tea cac aat gta tac afr- 
Asn Tyr Asn Ser His Asn Val ^ nl Met f 

145 ^ lie "et Ala Asp Lye Gin Lys Aen 

160 

39* ate aaa gtq aac ttr- aan 

Oly lie Lys U r^l Lys ^ His' L^n II" ^ ^^'^ 

y xar Arg His Asn lie Glu Asp Gly ser 

"° 175 

SJcSSS;!^s:5SlSIS^-S:Sl5I^U^ 

190 

cct gtc ctt tta cca gac aac cat tan n^.« 

- v.. ^„ ^ p„ Lp - s s ?s IS i« s: s 

'^"^ 205 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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teg aaa gat ccc aac gaa aag aga gac cac atg qtc ctt cth 
Ser Lys Asp Pro Asn Qlu Ly Arg Asp His Me? Si 2u Ilu pj^ 

215 220 

E s s ?s If? :s is s ^ 

235 240 

r?! m"? f^^ ^'^S ate cag ctg aac ccg gcc ccc 

Gly Met Pro Lys Lye Lys Pro Thr. Pro He Gin Leu Asn Pro Sa 5ro 
245 2S0 255 

Z 2^ S L- 1^ ?S SMS IS IS - 

265 270 

S SS J-1 2^ 2S IS SI IS IS - 

280 



<210> 8 
<211> 287 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: F25-MEK1 
construct 

<400> 8 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

5 IQ 



15 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 ' 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

55 60 

Leu Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pxx, Asp His Met Lys 

'° 75 eo 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 



Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
"0 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

150 



720 



758 



160 
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aiy lie Lys val Asn Phe Lys Thr Axg His Asn He 31u Asp aiy Ser 
1« 170 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser llu: Gin Ser Ala Leu 

205 

Ser Lys Asp Pro Asn Glu Lys Arq Asd His m*.*- w=i t 

210 one ^ <31u Phe 

^•^^ 220 

val Thr Ala Ala Gly He Thr His Gly «et Asp Glu Leu Tyr Asn Thr 

"° 235 
Gly Met Pro Lys Lys Lys Pro Thr Pro He Gin Leu Asn Pro Ala Pro 

250 255 
Asp Gly ser Ala Val Asn Gly Thr Ser Ser Ala Glu Thr Asn Leu Glu 

270 

Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu Gin Gin 



280 



<210> 9 
<211> 876 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (873) 

<220> 

<223> Description of Artificial Sequence: F25.MEK2 
construct 



s til '£ '". ^ IS I- ^ - s s L-: - 

^ 15 

K m ti L-s m s - - 

IS s 15 s: ^= ?s s ^= sj: s - 

40 45 

?s s - - s:: - ^ 

55 60 

''S 80 

SI s s ij; - ?s is 
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85 



90 55 



S ?S S 5£ Z - - ^ - - ^» ^ 

I i!5 z ^ 2; f.' s: - - 

125 

- g z s I ?s -I LI - 

140 

aac tat aac tea cac aat ata tan ^t-^ 

Asn Tyr Asn Ser His Asn Val iS m2 f?" f ^ f 

145 150 ^ ^-^^ Ala Asp Lys Gin Lys Asn 

160 

0-?^ s? s s s: - - - 

170 175 

1^ SI s- ^" Si ?s ?s s - - - 

190 

^ 2: z; li is s m 

^"'^ 205 

s ^ ^ ^ 1^ - 

210 y« /urg Asp HIS Met Val Leu Leu Glu Phe 

"^^^ 220 

S S K m ^ ?S 1!^ - Zl !K 2! - 

"° 235 240 

2S ;=i s I s ?2 s s IS 2: l- 

255 

s s s: s ?s is i!^ s 

s S! 23 SI s S3 S3 - S s: 11= 



280 285 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



gag cag cag taa 

Glu Gin Gin 876 
290 



<210> 10 
<211> 291 
<212> PRT 

<213> Artificial Sequence 
<220> 



20 



wo 00/50872 

PCT/USOO/04794 

SL'tScJ"'* ^"^"-1 ^--J-ence: F2S-MEK2 
<400> 10 



Met Ala ser Lys Gly Glu Glu Leu Phe 1^ ciy Val Val Pro He Leu 

Val Glu Leu A.p Gly Asp Val Asa Gly His Lys Phe Ser Val Ser Gly 

2^ 30 
Glu Gly Glu Gly Asp Ala Xhr ^ Gly Lys Leu ^ Leu Lys P.e lie 

40 45 
Cys ^ ^ Cly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 

60 

I-u Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

Arg Thr Xle Phe Phe Lys Asp Asp Gly Asn ryr Lys Arg Al^! Glu 

110 

val Lys Phe Glu Cly Asp xhr r.u Val Asn Arg xle Glu Leu Lys Gly 

125 

Xle ASP Phe Lys Glu Asp Gly Asn Xle Leu Gly His Lys Leu Glu .yr 

140 

Asn Tyr Asn Ser His Asn Val Tvr He m*.*- at * 

145 Ala Asp Lys Gin Lys Asn 

Gly lie Lys val Asn Phe Lys Thr Arg His Asn Xle Glu Asp Gly Ser 

170 

val Gin Leu Ala Asp His Tyr Gin Gin As, Thr Pro Xle Gly Asp Gly 

190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 
Ser Lys Asp Pro Asn Glu Lys Arq Asd hIb m^*- tr^i t 

220 

val Thr Ala Ala Gly Xle Thr His Gly Met Asp Glu Leu Tyr Asn ^ 

240 

Gly Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr He Asn 

255 

Pro Xle Ala Glu Gly Pro Ser Pro Thr Ser Glu Gly Ala Ser Glu 

270 

Ma Asn Leu Val Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp 



2«0 285 



Glu Gin Gin 
290 



21 



wo 00/50872 

PCT/USOO/04794 

<210> H 
<211> 889 
<212> WA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (888) 

<220> 

<223> Description of Artificial Sequence: Caspase 
3-DEVD-substrate construct ^^spase 

<400> 11 

IS 5: ^ ?s - - s z ?s tii - « 

1° 15 

IS s IS ?s | s: si - - ;s - - 
IS i!j s z s £* s ^ m z s: ?s si jj: ;n - 

60 

z I- g: jj: - - - - - 1« ~ 

80 

s L" jj: jl= -^^ zi ir, ;s si s: 
s?s^s:l=S5:isisi5s:^-?£-ss: 

110 

S ?s 2S ^- si - ir, 

125 

n.'s^s:;is:isg^si'S2ii5s:j^j:is:s? 

140 

^ L"s L': LI lit ^ s r r t 

145 ^-^^ Ala Asp Lys Gin Lys Asn 

160 

I?; ™ S! ^ - - s: SI s s: zi ^ z 

^"^^ 175 

?S SI S S ^ SI ^ SI SI III s ?s - - - 



192 



240 



288 



336 



384 



432 



480 



528 



22 



wo 00/50872 

PCTAJSOO/04794 

180 les 

r^^ ''''^ 3^*= «tg tec aca caa tot gcc ctt fi7d 

Pro val l^u Leu Pro Aap Asn His Tyr Leu Ser Thr Gin Se^ Ma 



205 



III rt I l"'' 9*'= gtc ctt ctt gag ttt 

ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Pte 

215 220 



255 



val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 



55 



60 



Leu cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



75 



Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
85 



90 



80 



Glu 



95 



Ar^ Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 



672 



720 



gta aca get get ggg att aca cat ggc atg gat aaa eta tan 
val Thr Ala Ala Gly lie Thr His Sy „e? I^p HI ^ 

"° 235 240 

gga aga agg aaa cga caa aag cga tea act ott aaa 
Gly Arg Arg Lys Arg Gin Ly^ A^g sel S lyl HI g?J 
245 250 



I?. I f.i zi Ty K ?s IS s - 

265 270 

IT. s s St s - - - - s S3 s zi 

275 280 285 

tct aca ccc cca agt acc egg ate c 
Ser Thr Pro Pro Ser Thr Arg lie 
290 295 



<210> 12 
<211> 296 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase 
3-DEVD-substrate construct 

<400> 12 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
5 10 15 



23 



wo 00/50872 



PCT/USOO/04794 



Val Lys Phe GIu Gly Asp Thr teu Val Asn Arg He Glu Leu Lys Gly 
115 ^ y 

He ABp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu 



135 140 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys 



Glu Tyr 



Asn 
160 



Gly Ser 
175 



150 155 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp 
165 170 ^ 

val Gin Leu Ala Asp His oyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 i9(j 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 



215 220 
Val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr 



230 



235 



Asn Ser 
240 



Gly Arg Arg Lye Arg Gin Lys Arg Ser Ala Val Lys Ser Glu Gly Lys 



250 255 



Met 



Arg Lys Cys Asp Glu Val Asp Gly He Asp Glu Val Ala Ser Thr 
260 265 270 

Ser Thr Val His Glu He Leu Cys Lys Leu Ser Leu Glu Gly Val His 
275 280 



285 



Ser Thr Pro Pro Ser Thr Arg He 
290 295 



<210> 13 
<211> 846 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(846) 

<220> 

<223> Description of Artificial Sequence: Caspase 
6-VEID-substrate construct 

<400> 13 

atg get age aaa gga gaa gaa etc ttc act gga gtt gte cca att ctt a ft 
Met Ala Ser Lys Gly Glu Glu Leu Phe Thr % VaJ Val Pro Se Le^ 
^ 10 15 

Va^ r"^ T"^ '^'^ ttc tct gtc agt gga 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

2° 25 30 

gag ggt gaa ggt gat gca aca tac gga aaa ctt ace ctg aag ttc ate 144 



96 



24 



wo 00/50872 

PCT/US00y04794 

Olu Gly Glu Gly Asp Ala Tyr Gly Lys Leu Thr Leu Lys Phe He 

45 

^= ^ S^-^" S IS ?5 IS ?S - ?S S S - 

i Z z 12 - ;s ^ 

''^ 80 

egg cat gac ttt ttc aaq aat acc 

^ .i. ^ ^ .,1 £ Zl '£ Hi ^ 1" 

90 

SI ?s s s IS K ^ ^ 5! - - 
isj r^z-z^^i iii ?s - ™ - 

120 2^25 

g is: s j:: s: zi if- is 2:: - 



240 



288 



336 



384 



"5 



ti: III L^: Si i^: r r 

145 150 ^ '^^^ 

160 



o^J 'It I- SI III - ^ - so .« I" SI 2?- S 

170 175 

?5 ir. 21 SI S SI - SI is; si - III SI - - Z 

185 190 

J~ III HI III SI SI SI III - III ?ii SI III SI III 

205 

s III SI m SI 111 is m si 11? ?ii 111 m - - 

^■'•^ 220 

s I?? ?s c^?3 ^2 r r '^'^^ "^^^ 

225 ^ „„ "^'^ A«P Glu Leu Tyr Aan Ser 

"° 235 240 

Tr^tS III S IS SI SI 12 ?|| SI SI tl SI SI SI 

250 255 

S ?ll 11? I£ IS SI SI SI III IS g| Si Si HI SI 



528 



576 



624 



672 



720 



768 



816 



270 



gaa gga gta cac agt aca cca cca age gca 
Glu Gly val His Ser thr Pro Pro Ser Ila 



25 



wo 00/50872 

PCT/USOO/04794 

^'^5 280 

<210> 14 
<211> 282 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caapase 
6-VEID-aubstrate construct 

<400> 14 

!er Lys Glv Glu Glu dk** mu- -^i- ^ „ , _ 

e Leu 



Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro II 
5 10 15 

val Glu Leu fl«p Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

^5 60 
I^u Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
^° " 80 

Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

105 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

lie ASP Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

ABn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

155 160 

Gly He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
1« 170 175 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

185 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 



200 



205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 

val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn Ser 
"° 235 240 

Gly Arg Arg Lys Arg Gin Lys Aig Ser Thr Arg Leu Val Glu He Asp 
245 250 



255 



26 



wo 00/50872 

PCT/USOO/04794 

Asn Ser Thr Met Ser Thr Val His ri,» r^ r ^ 
260 



270 



Glu Gly Val His Ser Thr Pro Pro Ser Ala 
275 280 

<210> 15 
<211> 876 
<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 
<222> (1) . , (876) 

<220> 

^'"^ SsSct*'" «e^ence: Caspase a-VHTO 

<400> 15 

gc aaa gga gaa qaa etc a/.*- _^ 

48 



30 

144 
192 



S s I" s If- - ?s ^ 

?S IS - - - - .-^ £ s 

Z ij: IS s ?s s: 

55 go 

s SI is cT, s- - ^ - - 

''^ 80 

s s.' z; - s - ^ - - 

110 

?s :s s z ?s p - ^ - - - z 

120 125 

Xl" IS. I?. LI S - tt. ^aa tac 

130 ^ ° Hia Lys Leu Glu Tyr 

140 

L^n L^n sS ^[^r Z Te M^? T T ''^^ 

145 ^ ^-^^ Ala Asp Lys Gin Lys Asn 



240 



288 



336 



384 



432 



160 



27 



wo 00/50872 

PCT/USOO/04794 



Val Glu Leu Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

Glu Gly Glu Gly Asp Ala Thr Tvr fiW t„» , ™^ 

F ^ Thr Leu Lys Phe lie 

*0 45 

Cys 0^ Thr Gly Lys Leu .^o Val Pro Trp Pro Thr Leu Val Thr 01. 
I-eu Cys ^ val Gl„ cys Phe Ser Arg ryr Pro Asp His Met Lys 



576 



624 



672 



tt zt ?s s I" zt m - 

-^'^'^ 175 

Si oL- L': s: ti; si ^ x s- zi ?s ?s i!« - 

205 

s z IS s s s s:? f r - 

210 ^ Val Leu Leu Glu Phe 

^•^^ 220 

Si ?s m ^?y t^t r ^^-^ -0 

225 ' 230 ^ *°P Tyr Asn Ser 

240 

99a aga age aaa cga caa aag cqa tea ta^ 

oiy ^ s=, ^ 4, - e« ,„ „. 

255 

s s: s ?s s s HI s - 

270 

i" s: s T. £s .t. - s s - 

285 

agt gcc gga tec 

Ser Ala Gly Ser 876 
290 



<210> 16 
<2H> 292 
<212> PRT 

<213> Artificial Sequence 
<220> 

<400> 16 

Met Ala ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

10 15 



768 



816 



80 



28 



wo 00/50872 



PCr/USOO/04794 



Arg His Aap Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



90 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg 



105 



95 



Ala Glu 



110 



Val Lys P.e Olu Gly Asp Ttr I.u Val Asn Arg He Glu I.u Lys Gly 

125 

lie ^ P,.e Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 



140 



ABn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys 

" 150 



155 



Gin Lys Asn 
160 



Gly lie Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 



170 



175 



val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

185 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 



200 



205 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 



210 



215 



Val Leu Leu Glu Phe 
220 



val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu Tyr Asn Ser 



235 



Gly Arg ser Lys Arg Gin Lys Arg Ser Tyr Glu Lys Gly 
245 



250 



240 



He Pro Val 
255 



Glu Thr Asp ser Glu Glu Gin Ala Tyr Ser Thr Met Ser Thr Val His 

270 

Glu lie Leu Cys Lys Leu Ser Leu Glu Gly Val His Ser Thr Pro Pro 



275 

Ser Ala Gly Ser 
290 



260 



285 



<210> 17 
<211> 906 
<212> DMA 

<213> Artificial Sequence 

<220> 

<221> 035 

<222> (1)..(906) 



<220> 
<400> 17 

^? lit s ^. ^ It: s til ;s is s " 



10 



15 



29 



144 



192 



240 



288 



336 



384 



WO 00/50872 

PCT/USOO/04794 

?s Si ^ s K z ^ m ^ jjs tt s " 
s: s IS ?s o^- s s ^ ^s: 

40 45 

i;: pro r ^'^'^ 

6iy Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

60 

i X S; s ^2 - - LI s:? 

''^ 80 

s s - =^ - - s| 

15 ?s ^= - - - - ~= ^c. ^ 

110 

?s IS L" ;s ?s s: s ?s is is 

^2** 125 

L'^ IS K I S - - Z S IS 

"'^•^^ 140 

«9 |ca ,ac aaa caa aa. aat 
145 ^ Ala Asp Lys Gin Lys Asn 

160 

S -S S? L": L= J- s= I" r .s. 

^■'^ 175 

va" X K ^ r T '^'^ " 

iBo ° ^ <5iy *«P Gly 

s s»' ^ - ™ - - - 

205 

ts^^'^ til IS s ss ;:f ?s r 

210 oi^ ^ Glu Phe 

^■^5 220 

val s s: I?? Ill Ts III zi r T ^^'^ -° 

225 ^ Asp Glu Leu Tyr Asn Ser 

"° 235 240 

l?;S^S.--SISS?S,-SL^oT,SJIS?SlSS 



480 



528 



30 



wo 00/50872 

PCT/USOO/04794 

Z IS ij: ?s s - s i:^ - 



265 270 



gac gca ggt agt act atg tct act qtc cac aaa ^^.^ *. 
ASP Ala Gly ser Thr Met Ser Thr Val h?^ ^T" f 



275 



Thr Val His Glu He Leu Cys Lys Leu 



etc 864 



260 



285 



age ttg gag ggt gtt cat tct aca ccc cca acrt nnn ^ 
ser Leu Clu Gly Val His Ser Thr Pro Pr^ Tr HI Sy III 

300 

<210> IB 
<211> 302 
<212> PRT 

<213> Artificial Sequence 
<220> 

<400> 18 

Met Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val 



906 



10 15 



Pro lie Leu 
15 

val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

2S 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

40 45 

cys Thr ^ Gly Lys Leu Pro Val Pro Trp Pro T^r Leu Val Thr Thr 

I^u cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

^5 80 
Arg His ASP Phe Phe Lys Ser Ala «et Pro Glu Gly Tyr Val Gin Glu 

" 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

125 

He ASP Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu tyr 

■^■^^ 140 

Asn Tyr Aan Ser His Asn Val Tyr He Met t 
145 ^ Gin Lys Asn 

155 

Oly lie Lys Val Asn Phe Lys Arg His Asn He Glu Asp Gly Ser 

170 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 



190 



Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 



200 



205 



31 



wo 00/50872 

PCT/USOO/04794 



Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val i^u Leu aiu Phe 

-^■^^ 220 
val Thr Ala Ala Oly lie Thr His Cly Met Asp Clu Leu Tyr Aan Ser 

235 240 
Gly Arg Arg Lys Arg Oln Lys Arg Ser Ala Gly Asp oi« Val Asp Ala 

250 255 
Oly ASP Olu val ASP Ala Gly Asp Olu Val Asp Ala Gly Asp Glu Val 

270 

Asp Ala Gly Ser Thr Met Ser Thr Val His Gin tio r ^ 

275 Cys Lys Leu 

285 

ser Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 



<210> 19 
<211> 906 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (885) 

<220> 



f S-^t^''™'^^""^'"^ sequence: Caspase 
8 -multiple VETD construct 

<400> 19 

«2 Si z IS i;: z 1^ ^Ai '^i s- - - 1- 'II - 

?S I- 22 2^ - s= - - - - . 

25 30 

s z i;: s?^ s ?s ^ I?; - «^ ^ - - - 
^ s ^: ?s s s ™ ?s ^ 

55 60 

5 S ^ IS - - IS S S K 

''^ 80 

s s Si j:: - IS - - r£ ^ s si - 
s ^ ?s s: - - - - ^ 2. 



240 



288 



32 



wo 00/50872 



PCT/USOO/04794 



?s K I" s ^ ;s s: - - 

att gac ttc aag gaa gat ggc aac att ctg gga cac aaa tto o»=. - 
lie JSP Phe .ys Olu Asp Oly Asn He Le^ I?y 

^ ^ L'? n° r ?r 

145 ncft ^ ® Gin Lys Aan 

160 

gga ate aaa gtg aac ttc aag acc cqc cac aar 

Oly lie .ys val Jsn Phe X.y^ Th. S Sa^ xi" II^ til 
1« 170 

?s LI z; 5-: - - - - 

185 2,90 

s - - - - ^ ^« 

200 205 

teg aaa gat ccc aac gaa aag aga gac cac atg gtc ctt ctt aaa ttt 
Ser Lys Asp P.o Asu Gl« Lys Arg Asp His Mef Val Le^ Le^ HI 

220 

?S S: S 1^ 1!^ S K IS - - ^ - 



384 



432 



480 



528 



624 



672 



235 240 

768 



gga aga agg aaa cga caa aag cqa tea ana rrnf 

^ ^ 4 111 Mi III s: s Sp s: 

'50 

S s: ^ s s ?s - - s 

265 270 
gac gca ggt agt act atg tct act gtc cac aaa 

ASP Ala Gly se. ^ Met Ser Thr ?S ^Xu xL^ S 2u 

280 285 

age ttg gag ggt gtt cat tct acacccccaa gtgccggatc c 
Ser Leu Glu Qly Val His Ser ay^tu c 



306 

290 295 



<210> 20 
<2ll> 295 
<212> PRT 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Caspase 
8 -multiple VETO construct 

<400> 20 

Met Ala ser Lys Gly Glu Glu Leu Phe Thr Qly Val Val Pro Xle Leu 



33 



wo 00/50872 

PCT/USOO/04794 
" 15 



val Gl« Leu ASP Gly Asp val Asn aiy His Lys Phe Ser Val ser Gly 

30 

Clu Gly Glu Gly Asp Ala Thr ^ Gly Lys Leu ^ Leu Lys Phe Xle 

Cys Thr Thr Gly Lys Leu P.o Val P«> 0^ p,o Leu Val ^ 

60 

■^u C>, ryr CI, va 01. cy. ^ ^ ^, 

''^ 80 

^ «. ^ Pje L,. s„ ,« ^ 

T,„ „. ^ ^ ... T„ ^ ^ 

110 

V.1 .y. <„. cxy «p ^ „^ ^ 

125 

Xle ASP Phe Lys Glu Asp Gly Aan lie Leu Gly His Lys Leu Glu Tyr 

-LJi 140 
Abu Tyr Asn Ser His Asn Val Tvr He ai, * 

145 ''^^ Ala Asp Lys Gin Lys Asn 

160 

Gly He Lys Val Asn Phe Lys Thr Ara Hio r^ 

165 ^ fin Asp Gly Ser 

170 j^^5 

val Gin Leu Ala Asp His Ty. Gin Gin Asn Thr Pro Xle Gly Asp Gly 

190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Gin Ser Ala Leu 

205 

Ser Lys Asp Pro Asn Glu Lys Arq Asn Hi« m^i- i . 

210 ^ Glu Phe 

^■^^ 220 

Val Thr Ala Ala Glv He Th-r w-tc m ^ 

235 

oay Arg Ars Lys Arg Gin Lys Arg Ser Ala Gly Val Glu ^r Asp Ala 

2^° 255 
aiy val gIu Xhr Asp Ala Gly Val Glu Thr Asp Ala Gly Val Glu Thr 

2^5 270 

Asp Ala Gly Ser Thr Met Ser Thr Val wl« ri„ n r 

275 Leu Cys Lys Leu 



280 285 



Ser Leu Glu Gly val His Ser 
290 295 



<210> 21 
<211> 4833 
<212> DNA 

<213> Artificial Sequence 



34 



wo 00/50872 

PCT/USOO/04794 

<220> 

<221> CDS 

<222> (1)..(4830) 

<220> 

^^^^^ Artificial sequence- 

EYPP-DEVD.MAP4-EBPP construcf 

<400> 21 



2? 2= 1?| - S '^^ - - - .JO 

S2 s s.^ IS til I s: - - i?^ 
IS m IS s ?s 51 s z^i s 
?s "I LI SI r s= - 

50 „ ''^^ ^''^ Trp Pro Thr Leu Val Thr Thr 

60 

'^^ 80 

cag cac gac ttc ttc aaq tec acn 

Gin Hl« Asp Phe Phe X.yt Hi Mel III T l^'' ''^^ 

^ «xa net Pro Glu Gly Tyr Val Gin Glu 

s ^ u': ^£ til s i:.= IS m 'Ii .-i s s is 

no 

?2 - - ^= s s SI - 1- z 

125 

s If; s 15 z ;s - - - si 

■^•^^ 140 

aac tac aac age cac aac qtc tat ^t. 

Asn lyr Asn Ser His Abb 111 Se Met f ^ f 

145 ^3^3 Met Ala Asp Lys Gin Lys Aan 

160 

SI IS IS. ts 'II s SI ;r 

165 ^ Asp 61y Ser 

175 

?2 SI SI S IS SI ^ S SI SI S IS .| IS - 
^?3S2I^-^«i-S?IS^SISSI2I 

205 



48 



96 



144 



192 



240 



286 



336 



384 



432 



480 



528 



576 



35 
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s s." ^ til s« ce, 

220 

I? ?s s K s - - ™ 

240 

IS o^: ?.i i:; - s IS 2: is 2: s s si - 

255 

s ^ s s s; «j - 2= «c „, 

260 otl ^® Met 

2" 270 

s s Su' ?s in ^ IS K ;s f ? s- 

275 280 ^ ""^ Val 

285 

SI ?S IS 5S K 52 2: 21 St St St r r 

290 ^-^y Asp Glu Lys Thr 01 v 

300 

aac tea gag tec aaa aag aaa ccc tqc tta cj^n 

Asn Ser Glu Ser Lys Lys Lys Pro Ss 2u fo« o^"" 

305 310 ^ Val Glu 

S S 12 1 2,1 52 ?£ 2: 2: 2S - SI i - 

330 335 

SI S «1 L'l ?£ 12 ^ 2^ 2: S 2= 

340 ^ ,!f ° Leu Glu Glu. 

350 

^' S! Sp 5S ^ SI S° SI 

355 ^ ''^'^ ^^"^ *sn Trp Pro Glu Asp 

365 

IS ^ J2 52 X 2^ IS 2! 2: Z ?2 12 IS L2 

380 

gag ccc ttt aac gag cac cat aat aaf 

Olu Pro Phe Asn Glu His £g ^ I?^ ctg etc ttt 

385 ^ P Asp Leu Leu Phe 

■^^^ 400 

?S 52 52 ^ S 25 L2 52 S 52 25 SI S 2,= 25 

415 

12 12 Z 52 52 oT. 2? 25 25 S5 25 52 25 K 52 

430 

S Sa' III gga gcc cca aac tct cca 

435 ^ Ala Asn Ser Pro 

445 



1152 



1200 



1248 



1296 
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S.' |2 IS - - ^ m z s i;: ^ - 

460 

s: s s s s: s S! r r ™ 

465 ^-^^ Val Glu Ser Val Lys Qlu 

^'^^ 480 

s s s s: s s s 

490 

9tg gtg cac tct cca tec aea na^ 

val val His ser Pro ?S Sp ^ ^ ^ ^^'^ 

500 ^ c«= Thr Glu 

SIO 

s s s s 5: 5: ij; s ?l= si ?s ?.i 

525 

s S S - ?| s - s - - - •*3. 

540 

s s s if- IS s s: l- l-; 

" 560 

SI IS ^ 1^ IS s - - ?s s ^ e is 

^''^ 575 

s Me? ii: ?s r 

580 Asn Asp Thr Thr Ser Pro 

590 

Ss Thr ^1^ ?S 2: r gac ttg gca cca cct 

595 ASP Leu Ala Pro Pro 

605 

Ol^ K ?2 III ^ -a c.a ecc cca gcc aa. ^.c 

610 " P'^o Ala I-ye Gly 

620 

m2 ^11 S 2u L^r ^1^ S: -3 3" aag aat gat gtt 

625 630 A«P val 

s LI IS IS ?s IS rj IS SI IS ?s - - is s; 

"° 655 

IS 1.1 SI laS s IS s r r «=• 

660 ^^"^ *«P He Thr 

-si|siiS.-f?ss:s^s;isLsis^issi 

685 



1728 



1776 



1968 



2016 
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Val ASP Met ^ P.O s X.U Glu Xhr Glu Met Thr .eu Gly z.ys 

700 

''^^ 720 

2^ t^. I S It: 2S ^. - ;2 - - 

s: s sf - - 5s z z ;s ?s l'J s 
I" IS s s - sr z LI 2^ m ;s 



'6° 765 



IS s s s 5^ s - - If; S s - 



2448 



2496 



"I ?:! L"p Z tli L" L" r r= 

785 ,on I-eu Pro Leu 

'95 

gaa aca aaa gta gca aca att cca at-^ 

eiu Thr Lya Val Ala Thr Val f ^^'^ S^ra act gta cag 

805 Thr val Gin 

815 

?s IS ^ s s iL- s SI s s s.= 

830 

^ m s K ?s - - ?s s 2; IS IS ?s 

845 

Sv.' s s s- IS? s j:: s s z is s is 

860 

2; SI L-^ SI SI IS SI m IS IS s'S IS 

875 

- 3." ^ |; - - - - s "! s - - - 

895 

s??£i!^sss;s?ssiisis^.-isiss:s 

910 

'dl IS IS S - S 21 S ^ S IS IS S ^ 

925 

act tea aca teg aaa gcc aaa aca caq ccc arh 

Se. Ti^ ser z,ys Ma Lys Thr III S s'e" Leu ^ro Ly^ 



2688 



2736 
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930 Q3C 
cca get ccc acc acc tct ggt ggg tta aat 

^^^.^r^r^ . - ig L'; i^. s2 is 

£! s IS IS ;s s K S S 

970 

990 



i?s Tri Te ii: s III ii: r ^'^•^ 

995 Thr Ser Pro Ser 



.^^s^ s IS ir| 2: IS HI '^i IS 

1020 



acc gtt tea aaa gcc aca tct ccc tea ar^ r^t-t- *. 

«i s,, .y. ^ - s s: £ I- =s 



3120 



1055 

3216 
3264 
3312 



s s s IS IS s s ?s 2J - - ^ - 

1050 3L055 

t^s III z s r ^^^^ ^'^^ ^- 
s i:- 2= IS IS - - - - . 

1075 inoA Ser Val 

5f L^n ?^r ?S ?2 I?? S: r r 9- -999 atg act 
1090 Jgf ^ Met Thr 

1100 

Se'r ^ tS. III s^r 111 T T '^^'^ ''^^ '''^ 3'=^ 

1105 iSo *^ ^^"^ Ala 

Ills 1120 

'^l £! fr.£. 1^' IS IS IS IS IS IS s 

" 1"0 1135 

IS S 52 IS S S IS S IS S IS S= IS 

^^^^ 1150 

J IS rj s^is IS SI - s: i^ si is si - 

1160 J3^g5 

?SSI!^SESI_^IIS.^|^^-SISSI?S - 



3360 



3408 
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I" s z til s z s: tn s 

1200 

S S S: S J ^=1 - - Z 3«. 

1210 1215 

^ IS ^ '£ SI ill cT. S - - - - - 

1220 looc ^ Asn 

1230 

•S J - - - - r Z 

^^40 1245 

s: 1- s - m k k si s 

-^-^^^ 1260 

^ys etc ^ 

1265 ^1270 ^ ^^"^ Leu 

^-^''^ 1280 

11^ IS I!; J SI S ?3 I- - - 

1290 1295 

S S S ?2 ?S - - - - 

1305 131Q 

IS SI J IS I!; SI ^ SI 15 IS I- - - 

1320 1325 

SIJS SI Sf III SI J SI SI III SI III SI SI SI in 

1340 

S« fly ?S 2^ 3^^; If 9- caa agg gag ccc cag 40B0 

- ^.1 III IS jS SI IS SS S SI K S? ?|? SI III SI 

1370 1375 

IS 22JII m - ;i? ?i! ii: «: - ?ii is si si si 

1385 1335 
gac gta aac gge cac aag ttc age gtq tee aao ™ 

'-i' s -° ^ IS SI SI SI SI 

1405 

Sl£ ill oT, III III S SI III III S - s ?s SI XI 



3696 



3744 



3792 



3640 



3688 



3936 



4126 



4176 



4224 
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BP - S J Z S: 55 z - s s 

1440 

3: s I- J ^ s IS s s SI St - s: 

1455 

s ir: „%^? s: o^^y r r "c ttc 

1460 ^ i4« *^ ^ Phe Phe 

" 1470 

^n; m z ^ - - s!^ - - - 

S S IS S Sf S2 S s"^' «^ |;e 

1500 

gac ggc aac ate ctg qqq cac aarr r,*-^ 

Asp Gly Asn He Leu ofy His L^s 2u ^ac 
1505 ^lll Qlu Tyr Asn Phe Asn Ser His 

'^^-^^ 1520 

s« s J s: s "f jf SI ifj - «c 

1535 

J SI L-i is j ^ si z s: is 

^^^^ 1550 

s: js Ls ?s ^: j: g; - .3 ^i s s 

1565 

ti: III til 111 r r 

1570 Ser Ala Leu Ser Lye Asp Pro Asn 

"'^ 1580 

£51 S Sl|? - - IS JJI ?2 SI - SI ^ 

^^^^ 1600 

S S 21 ««f IS SI S - - 



4320 



4368 



4416 



4464 



4512 



4560 



460e 



4656 



4704 



4752 



4633 



<210> 22 
<211> 1610 
<212> PRT 

<213> Artificial Sequence 
<220> 

^"""^ ^''^P*^^^^ °^ Artificial Sequence. 
EYPP.DEVD-MAP4-BBFP construct 

<400> 22 

Met Val ser Lys CZy alu oXu X.u P.e Thr Gly Val Val Pro Xle l.u 

15 
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Val Glu X.U ^ aiy ^ val Asn O.y His Lys Phe Ser Val Ser Gly 

30 

01« CI, „„ ^ ^ ^„ ^ ^^^^ ^ 

60 

Phe Gly tyr Gly Leu Gin Cya phe Ala Am tw « . 
65 ^ Ala Arg Tyr Pro Asp His Met Lys 

SO 

Gin His Asp PKe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

90 

lU PJ. P.. ^ Ty. ^ »^ Z Olu 

•^^^ 110 
val Lys Phe Glu Oly Asp Thr I.u val Asn Arg xie Glu Leu Lys Gly 

125 



XXe ASP Phe Lys Glu Asp Gly ^„ 

■^•^^ 140 



Aen Tyx Asn Ser His Asn Val Tvr Ii*:. m«»- i^i . 

145 3^50 ""^^ ^ ^-^^ Met Ala Aap Lys Gin Lys Asn 

160 

Oly lie Lys val Asn Phe Lys Xle Arg His Asn Xle Glu Asp Gly Ser 



170 i„ 



Val Gin Leu Ala Asp His Tvr Gin m.. » 

180 ^ lie Qly Asp Gly 

190 



Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser ryr Gin Ser Ala Leu 

205 

Ser Lye Asp Pro Asn Glu Lys Arq Asd Hi« m«^. ,r i r 

210 ^ "et Val Leu Leu Glu Phe 

^-^^ 220 

val «.r Ala Ala Gly xle Thr Leu Gly Met Asp Glu Leu Tyr Lys Lys 

240 

Gly Asp Glu Val Asp Glv Met Ala ao« t 

^ P net Ala Asp Leu Ser Leu Val Asp Ala Leu 



240 

Gly Met Ala Asn Lph a*,- 

245 

255 

Olu: Glu Pro Pro Pro Glu Xle Glu Gly Glu Xle Lys Arg Asp Phe Met 

270 



Ala Ala Leu Glu Ala Glu Pro ^ Asp Asp xle Val Gly Glu Thr Val 

285 

Glu Lys T^ Glu Phe Xle Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly 

300 

Asn ser Glu Ser Lys Lys Lys Pro Cys Leu Asp Thr Ser Gin Val Glu 

320 

Gly Xle Pro Ser Ser Lys Pro Thr Leu Leu Ala Asn Gly Asp His Gly 



"0 335 
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alu J.J ^, ^ ^. ^ ^ 



"5 350 



--9 V.1 ^ p„ ^ ^ ^ 

365 

Ala Ser Phe Cys Phe Gin Pro Gin Gin Val ^ 

370 ^-^^ ^^-^ Asp Thr Asp Gin Ala 

^'^ 380 

Glu Pro Phe Olu ^3 Arg Asp Asp cly .eu Ala Asp X.u Leu P.e 

•^^^ 400 
V.I ... Se. CI, ^ 3„ ^ ex. ^ ^. 

415 

«» Olu s„ ryr 01, M„ ^„ cy. «p s.. P,„ „. 

Thr Ala Val Val Ser Qln Qiu Trp Ser Val Gl^ ai » 

435 °-'^y Ala Pro Asn Ser Pro 

445 

.er Cl„ s„ c,. V.1 p„ ^ ^ 

460 

Pro Ala Thr Olu .eu Ser .ys Ala Ala Glu Val Olu Ser Val .ys Olu 

^''^ 480 
Cln X.eu Pro Ala .ys Ala .eu Cl« Thr Met Ala Olu Oln Thr Thr Asp 

val val His ser P.o Ser Thr Asp Thr Thr P„ oly P.o Asp Z Olu 

510 

Ma Ala Leu Ala Lys Asp xie Olu Olu Xle Thr X.ys Pro Asp Val xie 

^"^^ 525 
X-u Ala Asn Val ^ oin Pro Ser Thr Olu Ser Asp Met Phe X.u Ala 

540 

Oln ASP Met Olu Leu Leu Thr Oly Thr Olu Ala Ala His Ala Asn Asn 

560 

n. II. ^ P^o Tb, Olu P„ ^ 

575 

Pro Pro Met Olu olu Olu He Val Pro Pi« , 

580 lit ^ ^ Ser Pro 

5«5 590 

-y =lu ^ Olu T,„: ™, 

605 

Clu ASP val Leu Leu Lys Olu ^ oiu Leu Ma Pro Ala Lys Cly 

Met Val ser Leu Ser Olu Xle Olu Olu Ala Leu Ala Lys Asn Asp Val 

640 

Arg Ser Ala Glu lie Pro Val Ala r^r. ^ 

645 ^ Ser Glu Thr 

655 

Olu val val Leu Ala Thr Olu Val Val Leu Pro Ser Asp Pro Xle 
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^ Thx .ys ASP val Thr Pro Leu Olu Ala Olu j Pro Le. 

685 

..t P„ . ^ ^ 

700 

=1U p„ p„ ^ ^ 

7X5 

Ser Pro r.u Pro Glu Ser olu Val Thr Leu Gly Lys Asp val Val lie 

Leu Pro Glu Thr Lys Val Ala n.. ni,-. » 

YB vai Ala Glu Phe Asn Asa Val Thr Pro Leu Ser 

Olu Glu Glu val ser Val Lys Asp „et Ser Pro Ser Ala Glu Thr 

760 

Olu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser Gly Thr Glu Leu 

780 

ne val ASP Asn Ser Met Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu 

'^^^ 800 
Olu Thr Lys Val Ala Thr Val Pro Xle Lys Asp Lys Gly Thr Val Gin 

^ Olu Glu Lys Pro Arg Glu Asp Ser Gin Leu Ala Ser Met Gin His 

830 

Lys Gly Gin ser Thr Val Pro Pro Cys Ala Ser Pro Glu Pro Val 

845 

I-ya Ala Ala Glu Gin Met Ser Thr Leu Pro He Asp Ala Pro Ser Pro 

860 

jeu Glu Asn Leu Glu Gin Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu 

880 

Pro cya Ser Gly Val Ser Arg Gin Glu Glu Ala Lys Ala Ala Val Gly 

895 

val Thr Gly Asn Asp Xle Thr Pro Pro Asn Lys Glu Pro Pro Pro 

910 

ser Pro Glu Lys Lys Ala Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys 

925 

Thr ser Thr Ser Lys Ala Lys «u: Gin Pro Ser Leu Pro Lys Gin 

^■^^ 940 
Pro Ala Pro Thr Thr Ser Gly Gly Leu Asn Lys Lys Pro Met Ser Leu 

960 

Ma ser Gly Ser Val Pro Ala Ala Pro His Lys Arg Pro Ala Ala Ala 

970 375 

Thr Ala Thr Ala Ar. Pro Ser ^ .eu Pro Ala Ar, Asp Val Lys Pro 
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III ^ - - 

1005 

''■..^ =~ '"^JJl ^ Cl, ^ 

■^"■^^ 1020 

ms'-' ^' "» L.„ V,l S„ T.„ p„ 

^^^^ 1040 

"'x^ ^x. ^ P„ ^ 

-^"^^ 1055 

"".SJ ^y. H.t ^ „. 

^^^^ 1070 
Ala ser Ma ser ser Lya ^ ser Ma ser Ser Val 

^"^^ 1085 

Lye Arg Asn Thr Thr Pro Thr Gly Ala Ala Pr^ o 

1090 1095 Ala Pi^o Pro Ala Gly Met Thr 

" 1100 

Ss"" »^ V.1 ^„ ^ 

^•^^^ 1120 

S„ „.a ;.p^j^,. 

1"° 1135 
Pro Arg Val Ser Arg Leu Ala Thr Thr Val Ser n 

1140 ^ ^° Asp I-eu Lys 

1150 

ser Val^A^ ser Lys Val Gly Ser Thr Glu Asn lie Lys His Gin Pro 

1165 

Oly^Gly Gly Arg Ma Lys Val Glu Lys Lys Thr Glu Ma Ma Thr Thr 

^^^^ 1180 
^a^Gly Lys Pro Glu Pro Asn Ma Val Th, Lys Ma Ma Gly Ser lie 

1200 

Ala ser Ma Gln^Lys Pro Pro Ma Gly Lys Val oin lie Val Ser Lys 

^^^^ 1215 

Lys val ser Tyr Ser His He Gin Ser Lys Cys Val 9.. r . 

1220 T,-- ^ ^^-^ Ser Lys Asp Asn 

•^^^^ 1230 

a. v.a ,„ ^ ^„ 

1245 

Val^Asp lie ser Lys Val Ser Ser Lys cys Gly Ser Lys Ma Asn lie 

^'^^^ 1260 
Jya^Hls Lys Pro Gly^Gly Gly Asp Val Lys lie Glu Ser Gin Lys Leu 

"'5 1280 
Asn Phe Lys Glu Lys Ma Gin Ala Lys Val Glv s«r r . 

1285 ,,o„ ^ Asn Val 

1295 

.ly His Phe^Pro Ma Gly Gly Ma Val Lys ^ Glu Gly Gly Gly Ser 



1310 



45 
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^ ^.^^ ^Cy,^ „^ ^ 

1325 

- «. ^ „^ ^ 

•'■'^ 1340 

- oiy «. ^ ^ 3„ „^ ^ ^ 
^ L~ ^ s„ ojj n, ci. ^ 

1375 

Glu Glu Leu Phe Thr Gly Val Val Pr« ti , 

1380 ^ ""^^^f^ Val Glu Leu Asp Gly 

1390 

v..^^ ex, „. ^ 

1405 

"a^™, 0., .y. ^^j^^ ^„ ^ ^ ^ 

- V.X ^ ..^ ^ ^ 

■*-^35 

1440 



Cy. P.e Se.^j., ^ 

1455 

Lys Ser Ala Met Pro Glu Gly Tyr Val Gin m . 

1460 ^ ° "e Phe Phe 

1470 

""."f - - V,. 

" 1485 

•-x^ ^n. .1. ^. ^ 

1500 

JJP^=iy ^. „. „. ^„ ^ 

•^^^^ 1520 

° - ^ .r. o.. 

"35 

"* •» -J s„ val 01. ^ „. 

^^^^ 1550 
- - - - P» ne 01, ^ ^„ 

"P^..; .1. :^ ^ ^ ^ 

S;='" cau ,.."1° ^ 

He Thr Leu Gly Met Asp Glu Leu Tyr Lys 



1610 



<210> 23 
<211> 978 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(978) 



96 



<223> Description of Artificial Sequence- 

GFP.nucleolus-Caspase 8 -anneSrxi- construct 
<400> 23 

^ s IS ^. m I- - - s m ti s: 

15 

gtt gaa tta gat oat aa^ ^f^^ =.^« 

vu K ;s 2: Si '^t - - - 
Su' SI - - - - Z 

35 ^ Thr Leu Lys Phe He 

45 

Ss^ tS ?2 III s s,^ r T '''^ 

50 ' t° Trp Pro Thr Leu Val Thr Thr 

60 

S ^= S c" s jj: s r - 

70 '^''^ Arg Tyr Pro Asp His Met Lys 

80 

egg cat gac ttt ttc aag agt qcc at« 

«. MP Pj. .,1 ,S - =^ i;; jc. 

110 

"I lii IS IS ?s z z - ;r r 

115 ,^): ^ Glu Leu Lys Gly 

125 

S I?y aaa ttg gaa tac 

130 ^ ^eu Gly His Lys Leu Glu Tyr 

140 

A^n ^ ser Si ife f ''''^ 

145 ^ lie Met Ala Asp Lys Gin Lys Asn 

160 

2S S K V-! Z V. S til IK - - - 

■"•^^ 175 

gtt caa eta gca aac cat- 

v.. Lp s: oS z ^ pS s: - - - 

190 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 
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195 



200 



205 



s IS til I" SI s= «? f? «t 

210 ,1^ Wet Val Leu Leu Glu Phe 

220 

?s ^ m 111 m ^ ?s - - - 2 i^ce 

2" 240 

255 

aga agt ggt ttt gag atg tct caa 

Arg ser Oly Phe II^ „e? sir S Pro lie So Se'r f" '^^^ 
260 ^ His Leu Thr Arg 

270 

si? s i?j iL^ LI ii^: s ?s ^^'^ ^tt a.4 

2,5 iir ASP Ala Gly Val Glu Thr Asp Ala Gly Val 

285 

?s s: s: z; r r 

290 Gly Ser Thr Met Ser Thr 

300 

s si is s is ir, ?s s.' 2^ 

320 

ccc cca agt gcc gga tec 

Pro Pro Ser Ala Gly ser 978 
325 

<210> 24 
<211> 326 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

GFP-nucleolus-CaBpase B-anneSu ii construct 
<400> 24 

Met Ala ser .ys Gly Glu Glu X.eu Phe Xhr Gly Val Val Pro Xle I.u 

10 ^5 



V.1 «U ^ «p o., ^ ^ 

30 

01, „. ™, ^ ^„ ^ 

cy Thr ^ p P„ Trp Z ,.1 

60 

Tyr 01, v.! C ^ ^ 

''^ 80 
^ His ASP Phe Phe Lys Ser Ala Met Pro Glu Gly ryr Val Gin Glu 
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Arg ne PJe Phe .y. ^ ^ ^ 

-^^^ 110 
Val X.ya Phe OI« oXy ^ V.l Asn xie Clu .eu X.ys aiy 



"0 125 



lie Phe Lys Glu Asp Oly Asn Xle Leu oiy His .ys Leu Clu Tyr 

140 



Asn Tyr Asn Ser His Asn Val Tno: 11^* m*.*- . 

145 i5(j ^ Wet Ala Asp Lys Gin Lys Asn 

■^^^ 160 
Oly lie Lys val jsn Phe Lys Thr Arg Hie Asn He Glu Asp Oly Ser 

170 

val cm Leu Ma Asp His lyr Gin Gin Asn Thr Pro He Gly Asp Cly 

190 

pro val Leu Leu Pro Asp Asn His ryr Leu Ser Thr Gin Ser Ala Leu 

205 

ser Lys Asp Pro Asn clu Lys Arg Asp His Met Val Leu Leu Glu Phe 

^-^^ 220 
val Thr Ala Ala Gly lie ^r His Gly Met Asp clu Leu Tyr Asn Ser 

240 

Cly Arg Lys Arg lie Arg Thr T^r Leu Lys Ser Cys Arg Arg Met Lys 

255 

^ s.. aiy P.= ai„ «t s„ ^ ^ ^ ^ 

270 

ser Ala Cly Val Clu ^ Asp ^a Cly Val Glu Thr Asp Ala Cly Val 

280 285 

Glu Xhr ASP Ala Gly Val Glu Thr Asp Ala Cly Ser Thr Met Ser Thr 

300 

val His Clu lie Leu Oys Lys Leu Ser Leu clu Cly Val His Ser Thr 



315 

320 



Pro Pro Ser Ala Gly ser 
325 



<210> 25 
<211> 948 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(948) 



<220> 

<223> ^cription of Artificial Sequence: 

GFP-nucleolus-Caspase 3-anneS^n li construct 
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<400> 25 

!!! f.^^ 9aa etc ttc act oa. . 

48 



^ s s ^ - - s s: 15 5« „. 

15 

9tt gaa tta gat aat aa^ 

V" ^ o!, - ^^ St ^. - - - 

" 30 

I" IfJ LI IS ^ g- - 2' r - «c 

35 ^ Thr Leu Lys Phe lie 

45 

SI s s s ?| s 1^- s - - 

60 

ctg tgc tat ggt gtt caa i-*-*. 

- - - - f SI ij: s - - IS St s: 

80 

egg cat gac ttt ttc aaa aot 

^ ^ - ,„ ^ ,„ 

95 

acc ate ttc ttc aaa 

Arg Thr lie Phe Phe' I^J f?^ aca cgt get gaa 

100 ^ Tyr hys Thr Arg Ala Glu 

110 

S 2" i?J K S ?S - ;s =s iL' J- 1?^ 

" 125 

- g S 2S. K S SI - s IS - 

140 

As'n ?S ^^3^ ate , 

"5 150 ^ "^'^ *=P Oln Lya Asn 

160 

SI S! SI 5£i s s si Li si r s' 

165 ^ Qiu Asp Gly Ser 

175 

gtt caa eta gca aac rah « 

v.. L. SI ^ SI 1^ L-i s s: - - 

190 

SI S SI SI IS - Si SI L- III ?s SI 12 SI 21 

205 

='2 i HI in L- g| ^ - CI SI LI 21 in S? - 

^ 220 

g S SI IS 15 SI ?S SI LI L'l oT - 

Met Asp Glu Leu Tyr Asn Ser 



96 



144 



192 



240 



288 



528 



576 



624 



672 
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s m IS s s s ?s s= 2: s 

270 

teg tat gaa aaa gga ata cca qtt oaa ara c=.^ ^ 

Ty, 0.U cl. II. p„ S L^p I" IS til 

285 

?S s| ^? s ?s s I?: - - - - - 

300 

gag ggt gtt cat tct aca ccc cca agt acc aaa fcr- 
Glu Gly val His Ser Thr Pro Pro s£ II'^ I?y 



315 



<210> 26 
<211> 316 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence- 

GFP-nucleolus-Caspase 3-annexin ll construct 
<400> 26 

«et Ala ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 



10 15 



val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

25 30 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Ly, Leu Thr Leu Lys Phe He 

45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

60 

I-u cys ryr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



80 



^ His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 



105 



110 



val Lys Phe Glu Gly Asp Leu Val Asn Arg He Glu Leu Lys Gly 

•^20 125 

ne ASP Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 



155 



160 
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Gly He Lys Val Asn Phe Lys Thr Arg His Asn ll. 

165 ^ Asp Gly Ser 

175 



vu 01. ^ „. ^ ^„ ^ ^ 

''^ 190 
V.1 ^. ^ ^„ ^ se. ^„ 

s» Ly. ,„ ^ ^ 2 ^ 

220 

M. ^ „^ ^ ^ ^ 

240 

Gly Arg Lys Arg He Arg Thr Tyr Leu Lys Ser r^. » » 

24S ^^'^ Arg Arg Met Lys 

255 

Arg Ser Gly Phe Glu Met Ser Arq Pro ti» b » 

260 ^ Se^^ His Leu Thr Arg 

^" 270 
s« ly. =lu .y. oiy n. v« a,„ ^ 

285 

Tyr Ser Thr Met Ser Thr Val m e n^ -r. 

290 <^1^ lie Leu Cys Lys Leu Ser Leu 

^ 300 

Glu Gly val His Ser Thr Pro Pro Ser Ala Gly Ser 



<21D> 27 
<211> 2088 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(1041) 

<220> 



S^^SS^T °' sequence: 
NLS-Pred25-synaptobrevin construct 

<400> 27 

^1 s s .™ s s: - - - 1- «c ^ 

s s ^ «' 2: - 

30 

^ys III s£ I?y Ii^^ £^ gca aca tac gga aaa 

35 ^ Asp Ala Thr Tyr Gly Lys 

" 45 

^eu tS III iTe Ss S Th' 9^*= tgg 

50 ^5 ^ Lys Leu Pro Val Pro oip 



48 



96 



144 



192 



52 
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i* ?s z £ s s - - s.' S S.' 

" 80 
tac ccg gat cat atg aaa egg cat qac tti- n-r, 
ryr P.O ^ Hi, - ^ „c a., ,g ^„ 

s: SI s s: js: - r r - ^' - 

100 P*'^ Abp Asp Gly Asa 

" 110 

?s s ii" s- n: r ^ - .„ 

115 i*^^ Asp Thr Leu Val Asn 

^'^^ 125 

aga ate gag tta aaa qqt att 

c% 1^ - - - - 2; 

140 

i .™ s ^ s= ~; „e 

155 _ 

160 

S S ^. Zl - K J^-; «o ^ 

1^*^ 175 

s iL- s s s s: 2; s; s: - s: si z 

190 

S n" §!y i:p' Ill '^'^ ^^'^ "9 

195 ^ J^J '^'^ Aap Asn His Tyr Leu 

205 

III o!: s« I?: i:; r r - 

210 P^^O Aan Glu Lys Arg Asp His 

" 220 

j| 5S 2: 1?? f.i s - 15 s ?s 
^ s" s ?s L's ?s s:? ?s ir r - 

y Met Ser Thr Gly Pro Thr Ala Ala Thr 

255 

ggc agt aat cga aaa ctt rarr 

□ly ser Asn A?g A^ ^ ^ "J - g , 

260 -.r: Asn Gin Val Asp Glu Val 

270 

gtg gac ata atg cga att aac oi-rr 

vu ^ ^ ^ £ - ^ ^ . 

285 

2u S cag gca ggc get . 

290 295 ^ ^^'^ Ala Gly Ala 

300 

tct caa ttt gaa acg age gca acc aaa . 

y y g a gcc aag ttg aag agg aaa tat tgg tgg 9, 



80 

288 
336 
384 
432 



480 



528 



576 



672 



720 



53 
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Ser Gin Phe 01« ser Ala Ala Lya l.u Lys Arg .y. ^ ^ ^ 

320 



^. t^'^v. j:? s: s ?g s v^' s ?.i IS - - 

330 

ri: ii: s: ii: s: ^? s\^^ ^^"^^^^^^ ^^^^-^-^ 

340 

aaaggctagc aaaggagaag aactcttcac tggagttgtc ccaattcttg ttgaattaga 1121 
tggtgatgtt aacggccaca agttctctgt cagtggagag ggtgaaggtg atgcaacata liai 
cggaaaactt accctgaagt tcatctgcac tactggcaaa ctgcctgttc catggccaac 1.41 
actagtcact actctgtgct atggtgttca atgcttttca agatacccgg atcatatgaa 1301 
acggcatgac tttttcaaga gtgccatgcc cgaaggttat gtacaggaaa ggaccatctt 13^1 
cttcaaagat gacggcaact acaagacacg tgctgaagtc aagtttgaag gtgataccct 1421 
tgttaacaga atcgagttaa aaggtattga cttcaaggaa gatggcaaca ttctgggaca 1481 
caaattggaa tacaactata actcacacaa tgtatacatc atggcagaca aacaaaagaa 1541 
tggaatcaaa gtgaacttca agacccgcca caacattgaa gatggaagcg ttcaactagc 1601 
agaccattat caacaaaata ctccaattgg cgatggccct gtccttttac cagacaacca 1661 
ttacctgtcc acacaatctg ccctttcgaa agatcccaac gaaaagagag accacatggt 1721 
ccttcttgag tttgtaacag ctgctgggat tacacatggc atggatgaac tgtacaacac 17B1 
cggtatgtct acaggtccaa ctgctgccac tggcagtaat cgaagacttc agcagacaca 1841 
aaatcaagta gatgaggtgg tggacataat gcgagttaac gtggacaagg ttctggaaag 1901 
agaccagaag ctctctgagt tagacgaccg tgcagacgca ctgcaggcag gcgcttctca 1961 
atttgaaacg agcgcagcca agttgaagag gaaatattgg tggaagaatt gcaagatgtg 2021 
ggcaatcggg attactgttc tggttatctt catcatcatc atcatcgtgt gggttgtctc 2081 
ttcatga 



2088 



<210> 28 
<211> 347 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence- 
NLS-Pred25-synaptobrevin construct 
<400> 28 

Met Arg Arg Lys Arg Oln Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 
5 10 15 



54 
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Oly val val Pxo Xle x.u Val Olu I.u Asp oiy val Oly Hie 

I-ys Phe Ser Val Se. Gly Glu Gly Glu Gly Asp Ma Thr j Gly Lys 

«0 45 
Leu 0^ I.U x.ys p.e He Cys Thr Thr Gly .ys x.u Pro Val Pro Trp 

55 60 ^ 

pro Thr Leu Val Thr Thr X.eu Cys Tyr Gly Val Gin Cys Phe Ser ^ 

'^^ 80 
ryr Pro ASP His Met Lys Arg His Asp Phe Phe Vys Ser Ala Met Pro 

«. oly ^ v,i .1. oxu „ ^ 

•^"^ 110 
xyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

"° 125 
Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu 

140 

Oly His Lys Leu Glu ^ Asn Tyr Aan Ser His Asn Val ryr He Met 

Ala ASP Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys T^r Arg His 

175 

Asn He Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 

190 

Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu 

205 

ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 

^■^^ 220 
Met Val Leu Leu Glu Phe Val Thr- 

225 230 ^ ^^"^ Met 

240 

A.P Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Pro Thr Ala Ala Thr 

" 2S0 255 

Oly ser Asn Arg Arg Leu Gin Gin Thr Gin Asn Gin Val Asp Glu Val 

2" 270 
val ASP He Met Arg Val Asn Val Asp Lys Val Leu Glu Arg Asp Gin 

285 

r^ys Leu ser Glu Leu Asp Asp Arg Ala Asp Ala Leu Gin Ala Gly Ala 

Ser Gin Phe Glu Thr Ser Ala Ala tv*, t«, t 

305 310 ^® ^^"^ Arg Lys Tyr Trp Trp 

320 

Lys Asn Cya Lys Met Trp Ala He Gly He Thr Val Leu Val He Phe 

335 

He He He He He Val Trp Val Val Ser Ser 



55 
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340 



<210> 29 
<211> 2106 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (1050) 

<220> 

<223> Description of Artificial Sequence- 
NLS-Fred25-cellubrevin construct 



<400> 29 

tga aaa cga caa aaa act aar^ „ . 

48 



Mel S S Gin Itl T T ^ "«= 

^ 3 g iys Arg Gin Lys Ala Ser Lye Gly Glu Glu Leu Phe Thr 



10 15 



s- ?s s s- - si l- i?; s.= 

2^ 30 

i?s '^i s If- S3 s: K - s s 

60 

2; SI S g= - - - ™ - J5 

80 

S SI j;: - s: - - ij: - - - 

85 90 35 

S SI S S2 li: SI ?II HI - - - SI III 

105 ij^Q 

SI ^ S III SI SI I^ IS SI - - - - SI 

125 

S «I S3 III SI 1 III HI ^ SI L'l S^ in 51 - 
S SI III 12 SI IS HI IS s° I- 5s f • "c «c .t, 

145 Val Tyr lie Met 

155 

gca gac aaa caa aag aat qqa ate aaa fH-« 

^ .y, ^ III III s? in in ^ :s SI 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 

• w XJ.C jjys vaX T,._ f«ii , — . 



56 
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^ «" K IZ S X 5 '.Z 

•^^^ 190 

^ ne s Sj - K i:: s - 

200 205 

K s LI it: ^ s il- :s ^ - - 

^■^5 220 

g z ?s ?s s s m s ™ 
s: ?s IS ^? - s 15 ?s IS - «s 

255 

5S s s: s s= s 5 1 S.' ?s 

270 

S in s ?s s s ?ir L-: s ^ fj - 

280 285 

L- SI 1 IS 2: s s s - s: - 

300 

i S IS - S2 5S I" ?S S IS LI S 

^ S S g: - - s 15 L- S 

330 

SI jf: ?s ?s -11 s K ?3 s SI ti s 

345 

taaatgagaa gaaaacgaca aaaggctagc aaaggagaag aactcttcac tggagttgtc alio 
ccaattcttg ttgaattaga tggtgatgtt aacggccaca agttctctgt cagtggagag ixvo 
g^tgaaggtg atgcaacata cggaaaactt accctgaagt tcatctgcac tactggcaaa X230 
ctgcctgttc catggccaac actagtcact actctgtgct atggtgttca atgcttttca 
asatacccgg atcatatgaa acggcatgac tttttcaaga gtgccatgcc cgaaggttat X350 
Stacaggaaa ggaccatctt cttcaaagat gacggcaact acaagacacg tgctgaagtc 1410 
aagtttgaag gtgataccct tgttaataga atcgagttaa aaggtattga cttcaaggaa 1470 
gatggcaaca ttctgggaca caaattggaa tacaactata actcacacaa tgtatacatc 1530 
atggcagaca aacaaaagaa tggaatcaaa gtgaacttca agacccgcca caacattgaa 1590 



768 



816 



864 



912 



960 



1008 
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gatggaagcg ttcaactagc agaccattat caacaaaata ctccaattgg cgatggccct 1650 
gtccttttac cagacaacca ttacctgtcc acacaatctg ccctttcgaa agatcccaac 1710 
gaaaagagag accacatggt ccttcttgag tttgtaacag ctgctgggat tacacatggc 1770 
atggatgaac tgtacaacac cggtatgtct acaggtgtgc cttcggggtc aagtgctgcc 1830 
actggcagta atcgaagact ccagcagaca caaaatcaag tagatgaggt ggttgacatc 1890 
atgagagtca atgtggataa ggtgttagaa agagaccaga agctctcgga gctagatgac 1950 
cgcgcagatg cactgcaggc aggtgcctcg cagtttgaaa caagtgctgc caagttgaag 2010 
agaaagtatt ggtggaagaa ctgcaagatg tgggcgatag ggatcagtgt cctggtgatc 2070 
attgtcatca tcatcatcgt gtggtgtgtc tcttaa 

2106 

<210> 30 
<211> 350 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence- 
NI.S-Fred25-cellubrevin construct 

<400> 30 

Met Arg Arg Lys Arg Gin Lys Ala Ser Lys Gly Glu Glu Leu Phe Thr 

5 10 15 

Gly Val val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His 

25 30 

Lys Phe ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys 

40 , 45 

Leu 0^ Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp 

Pro Thr Leu Val Thr Thr Leu Cys aVr Gly Val Gin Cys Phe Ser Arg 
^° " 80 

Tyr Pro Asp His Met Lys Arg His Asp Phe Phe Lys Ser Ala Met Pro 
85 90 95 

Glu Gly Tyr Val Gin Glu Arg Thr He Phe Phe » 

100 ^ ™^ Lys Asp Asp Gly Asn 



110 



Tyr Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn 

125 

Arg lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu 

Qly His Lye Leu Glu Tyr Asn Tyr Asn Ser h<« xr i 
145 i^ft ^ ^ Val Tyr He Met 

155 160 

Ala ASP Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys Thr Arg His 
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165 



170 



175 



Asn lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn 

180 



190 

Thr Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His lyr Leu 
"5 200 205 

ser Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His 

215 220 

Met val Leu Leu Glu Phe Val Thr Ala Ala Gly He His Gly Met 

235 240 
Asp Glu Leu Tyr Asn Thr Gly Met Ser Thr Gly Val Pro Ser Gly Ser 



250 



ser Ala Ala Thr Gly Ser Asn Arg Arg Leu Gin Gin Thr Gin 
260 265 270 



255 



Asn Gin 



val Asp Glu val Val Asp He Met Arg Val Asn Val Asp Lys Val Leu 

280 285 

Glu Arg Asp Gin Lys Leu Ser Glu Leu Asp Asp Arg Ala Asp Ala Leu 

295 

Gin Ala Gly Ala Ser Gin Phe Glu Thr Ser Ala Ala Lys Leu Lys Arg 



315 



320 



I-ys Tyr Txp Trp Lys Asn Cys Lys Met Trp Ala He Gly He Ser Val 

325 



330 



335 



Leu val He lie Val He He He He Val Trp Cys Val Ser 

"5 

<210> 31 
<211> 3171 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) .. (3168) 

<220> 

<223> Description of Artificial Sequence- 
NLS-BYFP-MAPKDM-KBFP construct 

<400> 31 

^ s s IS s 2? z S2 s; S2 ?S " 



10 15 



r ^ 22 s s q m ?s z si ^ - 



25 



30 



S v^f m S Z IS Z LI ^ 2 



40 45 
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- m S SI S S m S IS ?3 S I- - 

acc etc gtg acc acc ttc qqc tac aan o.-^ 

1^ v., ^ o'?^ s 31 ^ 5? 

80 

SI s? SI IS III - - s K III s: 

95 

l?y ^ Sx^ III S ?^ III r ^^'^ 

100 f^^ A«P Asp Gly Asn Tyr 

110 

125 

"I o';^ s z ni SI ^ IS ,T r ^ 

130 ^-^^ Asp Gly Asn He Leu Gly 

^•^^ 140 

SI III IS III - III SI SI III ?ll SI - SI 

ICC 

160 

S "I SI l-f ^ l!» s: - 52 ^ ^ j« ^ 

170 

ul SI III SI SI - - - IS in SI SI IS IS SI s 

190 

^ «l SI SI SI in 53 S in SI SI m SI Ill SI 

205 

cL^ ii" ti: 1^1 ti: r r '^^'^ 

210 Lys Arg Asp His Met 

^" 220 

21 III IS III S! ?l| SI SI S? S° SI 

225 230 ^ Asp 

240 

SI III SI III 5 SI SI SI SI SI SI SI 21 SI 21 

250 255 
gtg gat gcg ttg aca gaa eca cct cca aaa =.^.- 

..1 j,u p„ ,^ ^ IS SI IS SI IS SI III 

270 

s III IS II? SI s 21 IS SI SI in III SI SI SI ^ 

285 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 
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I" s IS t^. s z s: s r r 

290 OQc Asp Gly Asp 

300 *^ 

320 

SI u: s s s i.i s ?s 2: 2; g i:: - 
oT. i:^ ,M - z -I - 1« o., I 

•^^^ 350 
365 

tgg cca gaa gat gca age ttt tot ttc ca« or,*- 

...u ^ s„ ^ ^, - =s S S.' - 

380 

L^p 111 III - 

385 °* ^^"^ His A>^g Asp Asp Gly Leu Ala 

IS S S S S £ S s 

410 43^5 

s 3':° s s r.t SI =^ 

435 If; ' Val Gly Ala 

445 

S ^ IS S SI - - - - - 5s J. 

460 

fjj s s s I- s s 21 - - - - |« 

tea gtg aaa gag cag ctg cca act aaa ar.= ^».~ 

^ ... .y. .„ ^ s LI s s i;i 

4»0 

SI S S L" S SI S 2= ?S r =~ 

510 

- IS s 1:1 s s 21 J- - - - - - 

525 



li52 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 
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Pro JSP val lie .eu Ala Jsn Val Thr Gin Pro Ser Thr Glu Ser Asp 

^•'^ 540 

III s s s m ^? I" 2: ?s s S2 s 
s ?s - s t^; - - ^ ^ 

='0 575 
aag gat gta goa cca cct atg gaa gaa aaa a«■^ r^r. r. 

-y. ;>.p V,, p„ ^? I- I- «c ^ 

590 

S ?2 £ ^= - s JS ?S S - ;f: - ^ 

605 

ttg gca cca cct gag gat qtq tta c^^ nr,^ ^ 

Pro p,. Lp u 2: s: ?s Hi s 

620 

i s: z ;2 s| s 2: 2; - s - - - - s: 
z s i:^ s: i- - - - - ^ 

"0 655 

S3 ?s s s s s;: ;s - - 

670 

K S ^ 2^ ^ LI f ? r ^« 

nr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala 
"° 685 

Olu' S L'u' ?S K ^e1 p"^ ^'^^ 

690 "^^ ^ Ser Leu Glu Thr Glu Met 

"5 700 

i 2: If s ^ L=: ^ ij; -I 

'^^^ 735 

Sp ?3 ?s s It: s ^, ?.i s r - 

740 ^» Asn Val 

- 2: IS 1;: SI ^ » - - ^ 

765 

.'2 S S IJI S 21 12 5! L-: 1:2 S 21 IS 



2064 



2112 



2160 



2256 



2304 
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2496 

Pro lie 

830 

2544 
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770 

^'^ 780 

^ Zl S S L-^ ^1 - s 2? IS - - 

''^S 800 

L~ s s z ?ij s s ™ - 

815 

;S S! ^ IS IS J£= - - 5s S 

830 

S ?S IS S L-^ 15 - - - 1- 

845 

i!;sii?jiSo^L'js:?s?^s;-2!?ss;-«= - 

860 

?S 5? SI ^ S? S ?S ^ tl V-! ?S 

890 

?S S S g S! - - - - ^ 

895 

«i s.= IS ^s: jj: s is z ti 
IS IS ?s 5s s ^ tit s T. 

920 ^25 

IS I? ?3 ^= IS 1?^ g - - - - 

i!j s SI IS IS LI 15 si is 

MS MO 

L'n js: s IS ;s ^ s? IS r r "= 

QgE Ala Asp Lyo Gin Lys 

970 

SI IJI ?I? SI III ^ ;s - SI SI 51 IS IS 

985 

21 ?I! IS Si SI IS SI^JS - IS IS ?S IS SI l!I IS 



895 

2736 
2784 
2832 



2928 



2976 



I?? JS V-? SI SI IS JI SI SI ^ SI SI SI IS SI SI 

^^^^ 1020 



3024 



3072 
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s s ^ z s S S K s 2 

"° "35 1040 

?3 ?s s ^ s s: - is - - - 

1050 ^ 



1055 

tag 



<210> 32 
<211> 1056 
<212> PRT 

<2X3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence- 
NLS-BYFP-MAPKDM-EBPP construct 

<400> 32 

Met Arg Pro Arg Arg Lys Val Ser Lys Gly Glu Glu Leu Phe Thr 
5 10 15 



Gly 

val val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

25 30 

Phe ser Val Ser Gly Glu Gly Glu Gly Asp Ala Wxr Tyr Gly Lys Leu 

40 45 

Thr Leu Lys Phe He Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 

■ar Leu val Thr Thr Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr 

Pro ASP His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 

" 90 95 

Oly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr 

110 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

125 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 

"5 140 

His Lye Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr He Met Ala 
"° 155 

Aap Lys Gin Lys Asn Gly He Lys Val Asn Phe Lys He Arg His Asn 
"5 170 175 

He Glu ASP Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 



185 



190 



3171 



Pro lie Oly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 



205 
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^ 01„ ser Ala X.eu Ser ;.p p.. 



2" 220 



vaa .eu oa. p.e vaj ^ 

240 

Glu Leu Tyr Lys Lys Gly Asp Glu Val Asp Gly Ala Asn c 

^^^^ 255 
-1 «P «. Th, p„ „^ ^„ 

270 

Arg Asp Phe Met Ala Ala Leu Glu Ala Glu as . 

275 Tyr Asp Asp He Val 

285 

Gly Glu Thr Val Glu Lys Thr Glu Ph*» ti« n 

290 ^ 2^ Pro Leu Leu Asp Gly Asp 

300 

01, sej 0I„ se, P„ ^„ ^ ^ 

.e, 01„ CI, p„ p„ ^ ^ ^„ „. „^ 

335 

ci, «. „, ^„ ^ ^ 

"•^^ 350 
Clu 0I„ V.1 ^ p„ ^ ^„ 

365 

Trp Pro Olu ASP Ala ser Phe Cys P.e Oln Pro Gin Gin Val .eu Asp 

380 

^ ->.P 01. CI„ «^ p,» „. ^ ^ ^ ^ 

"5 400 

•n. ..u ,1 3„ ^ ^ 

Clu Arg ASP Asn Pro Ser Clu Asp Ser ryr oXy Met X.e« Pro Cys Asp 

ser P^ ji^. ser Thr Ala Val Val Ser Oln Glu Ser ^ Gly Ala 

445 

Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser Pro Glu Val Thr He 

460 

Olu Thr Leu Gin Pro Ala ^ gIu X.u Ser X.ys Ala Ala Glu Val Glu 

480 

^» vaa ox. ^ „. ^„ ^ 

490 

01. x>„ J,p ^ s.. ^ ^ ^ ^ 

510 

-P ^ 01„ «. .1, ^ „^ ^ 

525 

Pro ASP val xie x.eu Ala Asn Val Thr Gin Pro Ser Glu Ser Asp 
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^•^5 540 



Met Phe Leu Ala Gin ^ „et Glu Leu Leu Thr oly xhr Glu Ala Ala 

555 

HiB Ala Asa Asn lie lie Leu Pro Thr Glu Pro Asp Glu Ser Ser Thr 

570 

Lys ASP Val Ala Pro Pro Met Glu Glu Glu He Val Pro Gly Asn Asp 

590 

Thr Thr ser Pro Lys Glu Ttr Glu Thr Thr Leu Pro He Lys Met Asp 

600 gj,5 

Leu Ma Pro Pro Glu Asp Val Leu Leu Lys Glu Thr Glu Leu Ala 

620 

Pro Ala Lys Gly Met Val Ser Leu Ser Glu He Glu Glu Ala Leu Ala 

"° "5 
Lys Asn Asp val A^ ser Ala Glu He Pro Val Ala Gin Glu -n^: Val 

655 

val ser Glu Thr Glu Val Val Leu Ala Thr Glu Val Val Leu Pro Ser 

670 

ASP Pro He Thr Thr Leu Thr Lys Asp Val Thr Leu Pro Leu Glu Ala 

«00 685 
Glu Arg Pro Leu Val rt^ ^sp Met Thr Pro Ser Leu Glu Thr Glu Met 



700 



Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu Thr 



705 " "° ^-^^ ^ Asn Leu Gly Met 

720 



715 



Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu Val Thr Leu Gly Lys 

730 735 

ASP val val He Leu Pro Glu Thr Lys Vai Ala Glu Phe Asn Asn Val 

'^^^ 750 
Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val Lys Asp Met Ser Pro 



760 



765 



ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn Ala Asp Leu His Ser 



Oly Thr Glu Leu Xle Val Asp Asn Ser Met Ala Pro Ala Ser Asp Leu 

'=^5 800 
Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val Pro He Lys Asp Lya 

810 815 
Gly Met val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 

®25 830 
Leu val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 

845 

Gly Glu Gly Glu Gly Asp Ma Thr Tyr Gly Lys Leu Thr Leu Lys Phe 

860 



66 
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lie Cys Thr Thr Gly Lys Pro Val Pro Trp Pro Thr Leu Val Thr 
"° 875 880 



Thr Leu Thr His Gly val Gin Cys Phe Ser Arg Tyr Pro Asp His 



885 B90 



Met 
895 



Lye Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin 
Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 



925 



Glu val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys 

935 

Gly lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu 

955 960 

Tyr Asn Phe Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys 
965 „5 

Asn Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly 



985 



990 



Ser val Gin Leu Ala Asp His^^ Gin Gin Asn -mr^Pro He Gly Asp 

Gly^Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala 

^^^^ 1020 

Leu^Ser Lys Asp Pro^Asn Glu Lys Arg Asp^His Met Val Leu Leu Glu 

Phe val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
1045 1050 1055 

<210> 33 
<211> 1623 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1).,(1623) 

<220> 

<223> Description of Artificial Sequence- 

yPP-NLS-CP3 -multiple DEVD-CFP-Annexin 11 construct 
<400> 33 

s? ill z IS z s? s ill ffi n'l s " 



10 



15 



;s IS t2 i!v ^; lit ^ z ^ ^£ I- - s ^; « 

20 25 30 

gag ggc gag ggc gat gcc acc tac ggc aag ctg acc ctg aag ttc ate 144 



67 
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Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tqq ccc acc ctr o^rr ..^^ 
Cys Ojr Thr Gly l,ys Leu Pro U Pro ^ til ?S Leu ^ ^ 



55 



60 



ttc ggc tac ggc ctg cag tgc ttc gcc cgc tac ccc aac car at-rr ^.r. 
Phe Gly Tyr Gly Leu Gin Cys Phe Ila 4 Tyr 



192 



240 



^ s: ^ Es ^' Es ^ I" s cs ?s - 



85 90 95 



S Til PH ^K*" f 3^*= ^^'^ ^^'^ ^^'^ '^g'^ 3CC gag 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr A^g Ila Glu 

100 105 



336 



gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aaa aoc 
Val Lys Phe Glu Gly Asp OTar Leu Val Asa A^ He ll^ Le! lyt g?^ 

120 

ate gac ttc aag gag gac ggc aac ate ctg ggq cac aaa eta mo 
lie ASP Phe Lys Glu Asp Gly Asn He Leu 1^ lyl ^eu Ilu ^ 

aac tac aac age cac aac gtc tat ate atg qcc aac aaa n^r^ 
Asn xyr Asn Ser His Asn Val ryr Xle Me^ L'p L^f 

"° 155 160 

mv Tif f^^ ,f ? ^""^ ^'^'^ '=9" ate gag gae ggc age 528 

Gly He Lys Val Asn Phe Lys He Arg His Asa He 111 Lp cfy Ser 

X65 - 



170 175 



210 215 220 

val lit T T ^'^a 9^*= '^tg tac aag tec 

val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Ly! Ser 



Z^S^e^o^ IS S LI S m IS IS 

245 250 255 

ggt gac gaa gtt gat gca ggt gac gaa gtt gat gea got aae aaa ott 
Gly ASP Glu val Asp Ala Gly Asp Glu Val Lp 111 iTy ^p gi^ f^l 
260 265 



z £ s? Vr ni ^ is s s: m 



384 



432 



480 



v3 Gin f?"" f^"" ""^^ ^^"^ ^^^5 ^t^^ 99C gac ggc 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He oiy Lp G?y 

185 

Pro r? f^^ '''''' ^^"^ ''^^ ^9<= cag tec gcc eta 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser ^a 2u 

200 205 

lit ttt ^tg gtc ctg ctg gag ttc 672 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu pJe 



576 



624 



720 



768 



816 



664 



68 
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275 280 



?2 g S .-J I S - - til Z 

300 ^ 



285 

912 



1104 



1152 



Hi IS S!J S2 IS - - - J. 

320 

^ ."f IK I ?s s ??j ^! s s - - 

335 

?s ;s ?s s LI ?s ?3 r - 

340 ^ ^ ^= Ser Arg Tyr 

350 

L- s: ^? 2- s ^ SL= IS - £1 ;2 - - 

365 

9gc tac gtc cag gag cgc acc ate ttc ttc ^^r, 

Sly Tyr Val Gin Glu ;^g Thr lie Phe III t ^ T ^^'^ '^^'^ 

370 ^ ,rf -'■^^ ^''^ Lys Asp Aep Gly Asn Tyr 

380 

i S IS S2 S! ^= iJj - - s S ?2 S -° 

400 

S: S 1^ ;s 2= - 22 - - g „^ .... 

"f S2 r - s: s SI tt 5s S si 

430 

Sp s Ill s SI IS III III III I- SI s 

445 

S 21 SI SI ?2 5 2 21 - III SI SS 12 ill S 

'"^^ 460 

SI SI 25 in l!| SI ?l! S IS in HI SI III ||| 

^ 475 

480 

- SI III SI S - III SI 22 SI S L'l SI IS - 

495 

?S SI SI 22 IS ;i! ?S SI SI SI ?S SI S? IS 

510 

22 22 ?S IS SI SI SI SI 22 SI S^ 

515 Leu Ser 



1344 



1392 
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<210> 34 
<211> 541 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence* 

VFP-NLS-CP3-multiple DBVD-CFP-Annexin II construct 
<400> 34 

Met val ser Lys Gly 0I« Glu I.u Phe Thr Gly Val Val Pro He Leu 

10 15 



val Glu Leu Gly Asp Val Gly His Lys Phe Ser Val Ser Gly 

30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Xle 

45 

Cya Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr ^ 

60 

Phe Gly Tyr Gly Leu Gin Cys Phe Ala Arg Tyr Pro Asp His Met Lys 

'^^ 80 
Gin His ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Olu 

-^°5 110 
val Lys Phe Glu Gly Asp Thr Leu Val As^ Arg lie Glu Leu Lys Gly 

125 

lie ^p Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 

140 

Asn Tyr Asn Ser His Asn Val Tvr He M^t- at. , 

145 Ala Asp Lys Gin Lys Asn 

Gly He Lys Val Asn Phe Lys He Ara Hi« n 

165 ^-^-^ Ser 

170 3^75 

val Gin Leu Ala Asp His xyr Gin Gin Asn Thr Pro He Gly Asp Gly 

190 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 

205 

ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

220 

val Thr Ala Ala Gly lie 0^ Leu Gly Met Asp Glu Leu ^ Lys Ser 



235 



240 



70 
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Oly Arg Arg Lys Arg Gin Lys Arg Ser Ala Gly Asp Glu Val Aap Ala 
245 250 255 

Gly ASP Glu val ASP Ala Gly Asp Glu Val Asp Ala Gly Asp Glu Val 

265 270 

Asp Ala Gly ser Thr Met Val Ser Lys Gly Glu Glu Leu Phe IHr Gly 
275 280 285 

val val Pro He Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 

Phe ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
"° 315 

Ihr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 

330 

Thr Leu Val Thr Thr Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr 
340 345 350 ^ ^ 

Pro ASP His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 

-^^^ 360 



Gly TVr Val Gin Glu Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr 

■^'S 380 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 

395 400 

He Glu Leu Lys Gly He Asp Phe Lys Glu Asp Gly Asn He Leu Gly 
405 410 415 

His Lys Leu Glu Tyr Asn Tyr He Ser His Asn Val Tyr He Thr Ala 

425 430 

Asp Lys Gin Lys Asn Gly He Lys Ala Asn Phe Lys He Arg His 



440 



Asn 



445 



He Glu Asp Gly ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 

455 4go 

Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 



475 



480 



Thr Gin ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Aap His Met 
485 490 495 

val Leu Leu Glu Phe Val Thr Ala Ala Gly He rtr Leu Gly Met Asp 
Glu Leu T3^ Lys Met Ser Thr Val His Glu He Leu Cys Lys Leu Ser 



520 525 



Leu Glu Gly Val His Ser Thr Pro Pro Ser Ala Gly Ser 
530 535 



540 



<210> 35 
<211> 24 
<212> DNA 



71 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG epitope 
<400> 35 

gactacaaag acgacgacga caaa 

<210> 36 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: FLAG epitope 
<400> 36 

Asp Tyr Lys Asp Asp Asp Asp Lys 
1 5 



24 



<210> 37 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 37 

tacccatacg acgtaccaga ctacgca 

<210> 38 
<211> 9 
<212> PRT 

<2a3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HA epitope 
<400> 36 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 



<210> 39 
<211> 18 
<212> DMA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 39 

ccaccagaac cagaaaca 



13 



<210> 40 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: KT3 epitope 
<400> 40 

Pro Pro Glu Pro Glu Thr 
1 5 

<210> 41 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Myc epitope 
<400> 41 

gcagaagaac aaaaattaat aagcgaagaa gactta 



36 



<210> 42 
<211> 12 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Myc epitope 
<400> 42 

Ala Glu Glu Gin Lys Leu He Ser Glu Glu Asp Leu 



<210> 43 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1),.(717) 



10 



<220> 

<223> Description of Artificial Sequence: EYFP 
<400> 43 



46 

Leu 

15 



^ S! SI ^ S2 - - 15 j2 ^ «c 

s ^ z s -^i m I- IS lit -I 
?s z; t^'^-i^, - - - - - - 



96 



144 
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m ^ z ^. ts^^^^^sii-^ - ^ ?s - 

55 60 

z z S2 s ^ til zi s: - 

75 go 

o^' L- ^ J- - s oL' is 

«5 90 95 

s L- m zi ^ ^. s - - 

110 

?s 5^ IS if^ - ?s a ti ^ s SI si ;s is 

nl Z SI L'^ l!J L'l ??: 21 15 S SI 21 SI 

ti: III 111 '.II 5^ iii r 

145 -.CA Lys Aen 

155 160 

ggc ate aag gtg aac ttc aag ate cge cac aae ahr^ ^ 
cay lie .ys val Phe .y^ zie lit til Tit tl^ I?y ITr 



480 
160 

528 



165 i-jfs 

170 175 



Va? Gin ?r f ""^^ ^'^^ ^^<= ate gge gae gge 

val Gin Leu Ala Asp Has Tyr Gin Gin Aen Thr Pro He G?y I^p Ify 

190 



576 



S 21 £1 S S.= 21 sl= ^ SI S SI 21 

"5 200 205 

s s: ^1 SI s s: f.t 21 21 i-i J2 

2-^5 220 

^ S= 15 S 2: 1:= |;| 21 ^ 

235 

<210> 44 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: EYFP 
<400> 44 

«et val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
^ 10 15 



74 
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Val Glu Leu ASP Gly Asp Val Asn Gly His Lye Phe Ser Val Ser Gly 
Qlu Oly Glu Gly ABp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
Cys Thr Thr Gly LyB Leu Pro Val Pro Trp Pro Thr Leu Val Thr Ihr 



55 60 



Gly Tyr Gly Leu Gin Cys Phe Ala Arg oyr Pro Asp His Met Lys 



75 



80 



Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 



90 



95 



Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
val Lys Phe Qlu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 



"5 



Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 



155 



160 



Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
1«5 170 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 IBS 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Tyr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

220 

val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 



230 235 

<210> 45 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) , , (717) 

<220> 

<223> Description of Artificial Sequence: EGFP 
<400> 45 

M^? o^"" ^^"^ ttc ace ggg gtg gtg ccc ate eta 4ft 

Met val ser Lys Gly Glu Glu Leu Phe Thr If? Va! U Pro Se III " 

^ 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 



75 



wo 00/50872 

PCT/USOO/04794 



Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val ser Gly 

?K i!^ IS m z m ^ ^ J- 2? s s i° - 



40 45 



S ?S ?S 5S S S ^ IS s J- 
IS 'C I- SI - - - - - 

90 95 

S Til pS p^'' r ^""^ ^3*= ^^'^ ^'^^ *=3= gag 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr A^g Ila III 
100 105 no 

III ^ IS m I'p ?S ^2 v^! LI S ul IS LI 1?^ 

120 

ill S IS L-p s: - IS s: s s? ^ - 

140 



192 



240 



288 



336 



384 



'^l ^ -I S ^ S - - - "I 



150 155 



480 
160 



f^r ill ^ S? ^ "! S SS S: IS L-^ l!J S 

"0 175 



Vaf Gin Z» ?r ''^'^ ''^'^ "3 ^=^9 93= gac ggc 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He lly Lp G?y 

185 

?2 1 S S It^ SI S: 23 S IS L- SI III 

200 205 

SI^L-ISISII?I.^I5-Si^l?I!iS-|SJII 

2-^5 220 

S SI SI SI I?? SI S SI If = SI? SI IS SI ^ 51 



528 



576 



624 



672 



717 



<210> 46 
<211> 239 
<212> PRT 

<213> Artificial Sequence 



235 



76 
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<220> 

<223> Description of Artificial Sequence: EGPP 
<400> 46 

Met Val ser Lys Gly Glu Qlu Leu Phe Thr Gly Val Val Pro He teu 
5 10 15 



val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Qly 
^° 25 30 ' 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 



40 



45 



Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 



60 



I^u Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

'° •'S 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 35 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
° 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

135 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

ISS 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
"5 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin As, Thr Pro He Gly Asp Gly 



185 



190 



Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 



200 



205 



ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 

val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 

230 235 

<210> 47 
<211> 717 
<212> DNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1) . . (717) 

<220> 

<223> Description of Artificial Sequence: EBFP 



77 
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<400> 47 

s v^? s z s ?s ?s ?!? ;i! ?s SI s " 

10 IS 

IS 2? 1?^ ;s - IS 
IS IS IS z s s ?s ?s V. s ?s s ^ s: 

45 

^'^/^m SI s s V-! SI ss s= ?s ?s ?s 

55 60 

s s IS ?s IS gi ^j: - - =s is si 

75 80 

IS SI SI III III III - - - 

85 90 95 

S ?ll ;JI III IS 1^ SI SI 15 SI IS I? ?S SI IS IS 
?2 "I III IS SI SI ?S SI ;2 SI 21 SI IS SI 1^1 SI 

120 

ate gac ttc aag gag gac ggc aac ate ctg ggq cac aaa e^« 
lie ASP Phe .ys Glu ASP Gly Asn IXe .e^ 

"5 

aac ttc aac age cac aac ate ^^»^ ^ 

Asn Phe Asn Ser His Asn vll ^ Til Met f^a f f"^ nT^ 
145 150 -^-^^ Gin Lys Asn 

155 160 

SI SI 7. 13 SI IS ^. SI SI SI SI SI IS SI SI SI 

170 175 

v^? IS SI £1 SI SI ^ IS SI SI IS SI SI SI SI SI 

^^5 190 

m S S SI SI SI SI SI ^ SI SI SI IS IS IS SI - 

2*^° 205 

SI :S SI ^ SI IS S SI SI SI III S' T 

210 ni e ^ Glu Phe 

215 220 

II! ?S LS SI SI SI IS SI SI SI SI IS SI IS ^ 



96 



144 



192 



240 



2B8 



336 



384 



432 



480 



528 



576 



78 
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<210> 48 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: BBPP 
<400> 48 

Met Val Ser Lya Gly Qlu Glu Leu Phe Thr Gly Val Val Pro He Leu 

^ 10 15 

val Qlu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 



25 



30 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
35 40 45 



lie 



Cys 1^ Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

55 60 



Leu Thr His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

'° '5 BO 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
1" 120 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
"° 135 140 

Asn Phe Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 1,5 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 



185 



190 



Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 



205 



ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 

val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 



230 235 



<210> 49 
<211> 717 
<212> DNA 

<213> Artificial Sequence 



79 
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<220> 

<221> CDS 

<222> (1)..(717) 

<220> 

<223> Description of Artificial Sequence: ECFP 
<400> 49 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate eta 4ft 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr cly Va! vS Pro lie Leu 

^^"^ ^^"^ ttc age gtg tec ggc 96 

val Glu Leu Asp Gly Asp Val Aan Gly His Lys Phe Ser Va! Ser Ity 
20 25 30 

gag ggc gag gge gat gee acc tac ggc aag ctg acc ctg aag ttc ate 14a 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys pSe 111 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Tr? Pro Thr Leu Va? ^ 
^° 55 60 

tI» ^ ^? ''^^ ^3*= '^^^ ««« 9ac cac atg aag 

Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lyf 

^= '0 75 80 

cag cac gac ttc ttc aag tec gcc atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Ilu 
85 90 95 

^^'^ 3ac gac ggc aac tac aag acc cgc gcc gag 336 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr LyI Thr A^g Ila l?u 

105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He ^lu Leu Lys 111 

120 125 

rlt o^*" f^^ ^^'^ ^^'^ ^^'^ ^^'^ '='=9 '^^'^ '^tg gag tac 432 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 

aac tac ate age cac aac gtc tat ate acc gcc gac aag cag aaa aac 4B0 
Asn Tyr He Ser His Asn Val Tyr He Thr Ila Lp Lyf Gin Lys ^n 
150 155 160 

ggc ate aag gcc aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Ala Asn Phe Lys He Arg His Asn He Ilu Lp tly ser 
165 170 175 

v^? f"*' '=*9 cag aac acc ccc ate ggc gac ggc 576 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He oiy Lp G?y 
180 185 190 

v!? f""^ r^^ '='=9 age acc cag tec gcc ctg 624 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser La Leu 
135 200 205 



240 



288 



80 
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z ^ IS s? t?. ^ L" s - 2? IS ^s: - 



215 220 



g ?S S lit ?2 - «? - - - - 

230 235 

<210> 50 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> DeBcription of Artificial Sequence: ECFP 
<400> 50 

Met Val ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
5 10 15 



val Glu Leu i^p Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

60 

Leu Thr Trp Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 

^5 80 
Gin His ASP Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

110 

val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

120 

lie Asp Phe Lys Glu Asp Gly. Asn lie Leu Gly His Lys Leu Glu Tyr 



"5 



Aan Tyr lie Ser His Asn Val Tyr He Thr Ala Asp Lys Gin Lys Asn 



155 



160 



Gly lie Lys Ala Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
^" "0 175 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

185 

Pro val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 

val Thr Ala Ala Gly He tbr Leu Gly „et Asp Glu Leu Tyr Lys 



717 



81 
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230 

-530 235 

<210> 51 
<211> 720 
<212> UNA 

<213> Artificial Sequence 

<220> 

<221> CDS 

<222> (1)..(717) 

<220> 

<223> Description of Artificial Sequence: Preci25 
<400> 51 

m SI I- ^ 2: js: - z s s s " 

10 15 

gtt gaa tta gat ggt gat gtt aac qqc cac aarr t-t-r. t-^^ ^ 

Val Clu oly ?aX A3„ ^ Ss^ L^s^ pL^ Sx Se^ 

S2 15 IS HI 111 s 2: ?s s k 

45 

?s ?s S l| ?s - ?S 2* ?2 s 

55 60 

s ^= ^ 2: 2: s s ;li ^- 

^5 80 

S T. ^ 52 JS Iff S 12 Z 

90 

5^1 1?. 2: 1" 15 IS ?2 25 ?S 2: ;S ?2 IS 2^ J^' - 

125 

?2 ^= 52 12 S 2: ?2 2! S2 2t 2; 22 

^•^^ 140 

2S ^ 2: 2? 2: 25 ;2 2? 12 L2 r r 

145 ^ -^-^^ Asp Lys Gin Lys Asn 

S 2: 2: 2? 25 52 2? ?2 S S: 25 ?2 22 25 2? 25 



144 



192 



240 



288 



336 



384 



432 



480 



175 



9tt caa eta gca gac cat tat caa caa aat act cca att ggc gat 



ggc 576 
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Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 



185 



190 



cct gtc ctt tta cca gac aac cat i-stn 

^ v.. ^.^^.i^ s: ^ S2 '^i s LI tr. is. 



205 



=2 L" ti: l^. "? ;•■= "3 stc «t ctt ttt 



.y. oxu ^ 1^ - - ij! jj: 

220 

I' ?s s s s s i!= «? - - - - - 

235 

<210> 52 
<211> 239 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Fred25 
<400> 52 

«et Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
^ . " 15 



val Glu Leu ABp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

Glu Gly Glu Gly Asp Ala thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
" 40 45 

Cys r^r Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

55 60 

Leu cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 



'5 80 



Arg His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
85 90 



Gin Glu 
95 



Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 



"5 



val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 



125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

-*-^5 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 

"5 150 

Gly He Lys val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser 
165 170 „5 

val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 



190 



83 



wo 00/50872 

PCTAJSOO/04794 

Pro Val Leu I^u Pro Asp Aen His Tyr Leu Ser Thr Gin Ser Ala Leu 

200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

215 220 

val Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
225 230 235 



<210> 53 
<211> 14 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Caspase.1,4 5 
substrate recognition sequence ' ' 



<400> 53 
tgggaacatg acaa 



14 



<210> 54 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1 4 5 
substrate recognition sequence ' ' 

<400> 54 
Trp Glu His Asp 
1 



<210> 55 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: proCaapase-1 
substrate recognition sequence 



<400> 55 
tggtttaaag ac 



<210> 56 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



12 



<220> 



<223> Description of Artificial Sequence: proCaspase-l 
substrate recognition sequence 



<400> SB 

Trp Phe Lys Asp 



84 
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1 



<210> 57 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CaBpase-2 
substrate recognition sequence 

<400> 57 
gacgaacacg ac 

<210> 5fl 
<21l> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of . Artificial Sequence: Caspase-2 
substrate recognition sequence 

<400> 58 
Asp Glu His Asp 
1 



<210> 59 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 59 
gacgaagttg ac 

<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 60 
Asp Glu Val Asp 
1 



<210> 61 
<211> 12 
<212> DHA 

<213> Artificial Sequence 
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<220> 



<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence P^°^aspase 3 

<400> 61 
atagaaacag ac 

12 

<210> 62 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

°^ Artificial Sequence: proCaspase-3 
substrate recognition sequence °^^spase 3 

<400> 62 

He Glu Thr Asp 



<210> 63 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase 4 s 
substrate recognition sequence ProCaspase-4 , 5 

<400> 63 
tgggtaagag ac 

12 

<210> 64 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

SS"'*'' °^ Artificial sequence: proCaspase-4 , 5 
substrate recognition sequence 

<400> 64 

Trp Val Arg Asp 



<210> 65 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Caspase 6 
substrate recognition sequence ^^^P^^e-e 

<400> 65 
gtagaaatag ac 

12 



86 
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<210> 66 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence ^^^^^ « 



<400> 66 

Val Glu He Asp 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

s^sSf^f°'' °' ^ificial sequence: Caspase-6 
substrate recognition sequence 

<400> 67 
gtagaacacg ac 

12 

<210> 68 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

^^""^^ °^ Artificial Sequence: Caspase-6 

substrate recognition sequence 

<400> 68 

Val Glu His Asp 



<210> 69 
<211> 12 
<212> DMA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence P^o^^spase 6 



<400> 69 
acagaagtag ac 

<210> 70 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



12 
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<220> 

<223> Description of Artificial Sequence: proCaspase-6 
substrate recognition sequence 

<400> 70 

Thr Glu Val Asp 



<210> 71 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 



<400> 71 
atacaagcag ac 



<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



12 



<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 



<400> 72 
lie Gin Ala Asp 
1 



<210> 73 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
substrate recognition sequence 

<400> 73 
gtagaaacag ac 

12 

<2X0> 74 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-e 
substrate recognition sequence 

<400> 74 
Val Glu Thr Asp 
1 



B8 
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<210> 75 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: proCaspaBe-S 
substrate recognition sequence 



<400> 75 
ttagaaacag ac 



<210> 76 
<211> 4 
<212> PUT 

<213> Artificial Sequence 
<220> 



12 



<223> Description of Artificial Sequence: proCaspase-8 
euDstrate recognition sequence 



<400> 76 
Leu Glu Thr Asp 
1 



<210> 77 
<2H> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspa8e-9 
substrate recognition sequence 

<400> 77 
ttagaacacg ac 

12 

<210> 78 
<2H> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ca8pase-9 
substrate recognition sequence 

<400> 78 
lieu Glu His Asp 
1 



<210> 79 
<211> 12 
<212> DNA 

<213> Artificial Sequence 



89 



wo 00/50872 



<220> 

<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 

<400> 79 
ttagaacacg ac 

<210> 80 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 

<400> 80 
Leu Glu His Asp 
1 



<210> 81 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 81 
agccaaaatt ac 



<210> 82 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 82 
Ser Gin Asn Tyx 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 83 
ccaatagtac aa 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CasDase-2 
substrate recognition sequence ^ ^ 

<400> 57 
gacgaacacg ac 

12 

<210> 58 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Ca8DaBe-2 
substrate recognition sequence Caspafie-2 

<400> 58 

Asp Glu His Asp 



<210> 59 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

°* Artificial Sequence: Caspase-3,7 
substrate recognition sequence 

<400> 59 
gacgaagttg ac 

12 

<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-3 7 
substrate recognition sequence ^^^P^^e 3,7 

<400> 60 
Asp Glu Val Asp 
1 



<210> 61 
<211> 12 
<212> DNA 
<213> Artificial 

<220> 



Sequence 
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<223> Description of Artificial Sequence: proCaspase-3 
substrate recognition sequence 



<400> 61 
atagaaacag ac 



<210> 62 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



12 



<220> 

<223> Description of Artificial Sequence: proCaspase-S 
substrate recognition sequence 



<400> 62 
He Glu Thr Asp 
1 



<210> 63 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: proCaepase-4 5 
substrate recognition sequence 



<400> 63 
tgggtaagag ac 



<210> 64 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 



12 



<223> Description of Artificial Sequence; proCaspase-4, 5 
substrate recognition sequence 



<400> 64 

Trp Val Arg Asp 



<210> 65 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CaspaBe-6 
substrate recognition sequence 
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<400> 65 
gtagaaatag ac 

12 

<210> 66 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 66 
Val Glu He Asp 
1 



<210> 67 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 67 
gtagaacacg ac 



<210> 68 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-6 
substrate recognition sequence 

<400> 68 
Val Glu His Asp 
1 



<210> 69 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaBpase-6 
substrate recognition sequence 

<400> 69 
acagaagtag ac 
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<210> 70 
<211> 4 
<2a2> PRT 

<213> Artificial Sequence 
<220> 



fi^^S^^'''' Artificial sequence: proCaspase-e 
substrate recognition sequence 



<400> 70 
Thr Glu Val Asp 
1 



<210> 71 
<2H> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 



<400> 71 
atacaagcag ac 

<210> 72 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



12 



<220> 

<223> Description of Artificial Sequence: proCaspase-7 
substrate recognition sequence 



<400> 72 

He Gin Ala Asp 



<210> 73 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Caspase-B 
substrate recognition sequence 



<400> 73 
gtagaaacag ac 



<210> 74 
<211> 4 



12 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-8 
siajstrate recognition sequence 

<400> 74 
Val Glu Thr Asp 
1 



<210> 75 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: proCaspase-8 
sut)strate recognition sequence 



<400> 75 
ttagaaacag ac 



12 



<210> 76 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: proCaspase- 
substrate recognition sequence 

<400> 76 
Leu Glu Thr Asp 
1 



<210> 77 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-9 
substrate recognition sequence 

<400> 77 
ttagaacacg ac 

12 

<210> 78 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Caspa8e-9 
substrate recognition sequence 



<400> 76 

Leu Glu His Asp 



<210> 79 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 



<400> 79 
ttagaacacg ac 



<210> BO 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 



12 



<223> Description of Artificial Sequence: proCaspase-9 
substrate recognition sequence 



<400> 80 
Leu Glu His Asp 
1 



<210> 81 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 



<400> 81 
agccaaaatt ac 



<210> 82 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



12 



<220> 



<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 
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<400> 82 
Ser Gin Asn Tyr 
1 



<210> 83 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 83 
ccaatagtac aa 

12 

<210> 84 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HIV protease 
substrate recognition sequence 

<400> 84 
Pro lie Val Gin 
1 



<210> 85 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 85 
atgtttggag ga 



<210> 86 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 86 

Met Phe Gly Gly 



97 



wo 00/50872 



PCT/US00AM794 



1 



<210> 87 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 87 
gcaaaaaaaa ga 

12 



<210> 88 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Adenovirus 
endopeptidase substrate recognition sequence 

<400> 88 
Ala Lys Lys Arg 
1 



<210> 89 
<2H> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 89 
gtgaaaatg 



<210> 90 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 90 
Val Lys Met 
1 
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<210> 91 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 91 
gacgcagaat tc 

12 

<210> 92 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> description of Artificial Sequence: b-Secretase 
substrate recognition sequence 

<400> 92 
Asp Ala Glu Phe 
1 



<210> 93 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 



s^«^^'*^^'''' Artificial Sequence: Cathepsin D 
substrate recognition sequence 



<400> 93 

aaaccagcat tattc 

<210> 94 
<2H> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 



15 



^''^''^ Artificial Sequence: Cathepsin D 

substrate recognition sequence 



<400> 94 

Lys Pro Ala Leu Phe 
1 5 



<210> 95 
<211> 9 
<212> DMA 
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<213> Artificial Sequence 
<220> 



Sh''^^^^"" Artificial sequence: Cathepsin D 
substrate recognition sequence f ^ ^ 



<400> 95 
ttcagatta 



<210> 96 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 



'''''' s^SKi'''' °' ^tificial sequence: Cathepsin D 
substrate recognition sequence 



<400> 96 
Phe Arg Leu 
1 



<210> 97 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Matrix 

Metalloprotease substrate recognition sequence 



<400> 97 
ggaccattag gacca 

<210> 98 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 



15 



<223> Description of Artificial Sequence: Matrix 

Metalloprotease substrate recognition sequence 



<400> 98 
Gly Pro Leu Gly Pro 
1 5 



<210> 99 
<211> 12 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Granzyme B 
substrate recognition sequence 

<400> 99 
atagaaccag ac 

<210> 100 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Granzyme B 
substrate recognition sequence 

<400> 100 
lie Glu Pro Asp 
1 



<210> 101 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 101 

atgcccaaga agaagccgac gcccatccag ctgaac 

<210> 102 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition secfuence 

<400> 102 

Met Pro Lys Lys I*ys Pro Thr Pro lie Gin Leu Asn 
15 10 

<210> 103 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 
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<400> 103 

atgctggccc ggaggaagcc ggtgctgccg gcgctcacca tcaac 

<210> 104 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Anthrax 
protease substrate recognition sequence 

<400> 104 

Met Leu Ala Arg Arg Lys Pro Val Leu Pro Ala Leu Thr He Asn 
^5 10 15 

<210> 105 
<211> 18 
<2a2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus /botulium substrate recognition sequence 

<400> 105 

gcctcgcagt ttgaaaca 

<210> 106 
<211> e 
<212> PRT 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 106 

Ala Ser Gin Phe Glu Tlir 
1 5 

<210> 107 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 107 

gcttctcaat ttgaaacg 



102 



wo 00/50872 



PCTAJSOO/04794 



<210> 108 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

tetanus/botulium substrate recognition sequence 

<400> 108 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 109 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 109 

gccaaccaac gtgcaaca 

<210> 110 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin A substrate recognition sequence 

<400> 110 

Ala Asn Gin Arg Ala Thr 
1 5 



<210> 111 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin B substrate recognition sequence 

<400> 111 

gcttctcaat ttgaaacg 

<210> 112 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin B substrate recognition sequence 

<400> 112 

Ala Ser Gin Phe Glu Thr 
1 5 



<210> 113 
<2I1> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C substrate recognition sequence 

<400> 113 

acgaaaaaag ctgtgaaa 



<210> 114 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin C stibstrate recognition sequence 

<400> 114 

Thr Lys Lys Ala Val Lys 
1 5 



<210> 115 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> 115 

gaccagaagc tctctgag 



<210> 116 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin D substrate recognition sequence 

<400> 116 

Asp Gin Lys Leu Ser Glu 
1 5 



<210> 117 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition sequence 

<400> 117 

atcgacagga tcatggag 

<210> 118 
<2H> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin E substrate recognition sequence 

<400> 118 

He Asp Arg He Met Glu 
1 5 



<210> 119 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin F substrate recognition sequence 

<400> 119 

agagaccaga agctctct 

<210> 120 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin F substrate recognition sequence 
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<400> 120 

Arg Asp Gin Lys Leu Ser 
1 5 



<210> 121 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 

<400> 121 

acgagcgcag ccaagttg 

<210> 122 
<211> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Botulinum 
neurotoxin G substrate recognition sequence 

<400> 122 

Thr Ser Ala Ala Lys Leu 
1 5 



<210> 123 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Cytoplasra/cytoskeleton target sequence 

<400> 123 

atgtctactg tccacgaaat cctgtgcaag ctcagcttgg agggtgttca ttctacaccc 60 
ccaagtgcc gg 

<210> 124 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 

Cytoplasm/cytoskeleton target sequence 
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<400> 124 

Met Ser Thr Val His Glu lie Leu 
1 5 

His Ser Thr Pro Pro Ser Ala 
20 



Cys Lys Leu Ser Leu Glu Gly Val 
10 15 



<210> 125 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Inner surface 
of plasma membrane target sequence 

<400> 125 

atgggatgta cattaagcgc agaagacaaa gcagcagtag aaagaagcaa aatgatagac 60 
agaaacttaa gagaagacgg agaaaaagct gctaga 9g 



<210> 126 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
of plasma membrane target 

<400> 126 

Met Gly Cys Thr Leu Ser Ala Glu 
1 5 

Lys Met He Asp Arg Asn Leu Arg 
20 



Sequence: Inner surface 
sequence 



Asp Lys Ala Ala Val Glu Arg Ser 
10 15 

Glu Asp Gly Glu Lys Ala Ala Arg 
25 30 



<210> 127 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence 

<400> 127 

agaaggaaac gacaaaag 



18 



<210> 128 
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<2H> 6 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleus target 
sequence ^ 

<400> 128 

Arg Arg Lys Arg Gin Lys 
1 5 



<210> 129 
<211> 90 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleolus 
target sequence 

<400> 129 

agaaaacgta tacgtactta cctcaagtcc tgcaggcgga tgaaaagaag tggttttgag 60 
atgtctcgac ctattccttc ccaccttact 

90 

<210> 130 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nucleolus 
target sequence 

<400> 130 

Arg Lys Arg He Arg Thr Tyr Leu Lys Ser Cys Arg Arg Met Lys Arg 

Ser Gly Phe Glu Met Ser Arg Pro He Pro Ser His Leu Thr 
2° 25 30 

<210> 131 
<211> 87 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mitochondria 
target sequence 

<400> 131 

atgtccgtcc tgacgccgct gctgctgcgg ggcttgacag gctcggcccg gcggctccca 60 
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gtgccgcgcg ccaagatcca ttcgttg 

<210> 132 
<211> 29 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Mitochondria 
target sequence 

<400> 132 

Met Ser Val Leu Thr Pro Leu Leu Leu Arg Gly Leu Thr Gly Ser Ala 
Is 10 15 

Arg Arg Leu Pro Val Pro Arg Ala Leu lie His Ser Leu 
20 25 



87 



<210> 133 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear 
Envelope target sequence 

<400> 133 

atgagcattg ttttaataat tgttattgtg gtgatttttt taatatgttt tttatattta 
agcaacagca aagatcccag agtaccagtt gaattaatg 



<210> 134 
<211> 33 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear 
Envelope target sequence 

<400> 134 

Met Ser He Val Leu He He Val He Val Val He Phe Leu He Cys 
1 5 10 



15 



Phe Leu Tyr Leu Ser Asn Ser Lys Asp Pro Arg Val Pro Val Glu Leu 
20 25 30 



Met 



<210> 135 
<211> 246 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 135 

atgaggcttc gggagccgct cctgagcggc agcgccgcga tgccaggcgc gtccctacag 60 
cgggcctgcc gcctgctcgt ggccgtctgc gctctgcacc ttggcgtcac cctcgtttac 120 
tacctggctg gccgcgacct gagccgcctg ccccaactgg tcggagtctc cacaccgctg 180 
cagggcggct cgaacagtgc cgccgccatc gggcagtcct ccggggagct ccggaccgga 240 
ggggcc 246 

<210> 136 
<211> 82 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Golgi target 
sequence 

<400> 136 

Met Arg Leu Arg Glu Pro Leu Leu Ser Gly Ser Ala Ala Met Pro Gly 
15 10 15 

Ala Ser Leu Gin Arg Ala Cys Arg Leu Leu Val Ala Val Cys Ala Leu 
20 25 30 

His Leu Gly Val Thr Leu Val Tyr Tyr Leu Ala Gly Arg Asp Leu Ser 
35 40 45 

Arg Leu Pro Gin Leu Val Gly Val Ser Thr Pro Leu Gin Gly Gly Ser 
50 55 60 

Asn Ser Ala Ala Ala lie Gly Gin Ser Ser Gly Glu Leu Arg Thr Gly 
65 70 75 80 

Gly Ala 



<210> 137 
<211> 150 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 
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<400> 137 

gaaacaataa gacctataag aataagaaga tgttcttatt ttacatctac agacagcaaa 60 
atggcaattc aattaagatc tccctttcca ttagcattac caggaatgtt agctttatta 120 
ggatggtggt ggtttttcag tagaaaaaaa 15q 

<210> 138 
<211> 50 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Endoplasmic 
reticulum target sequence 

<400> 138 

Glu Thr lie Arg Pro lie Arg He Arg Arg Cys Ser Tyr Phe Thr Ser 
15 10 15 

Thr Asp Ser Lys Met Ala He Gin Leu Arg Ser Pro Phe Pro Leu Ala 
20 25 30 

Leu Pro Gly Met Leu Ala Leu Leu Gly Trp Trp Trp Phe Phe Ser Aro 
35 40 45 

Lys Lys 
50 

<210> 139 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 139 

gccttgcaga agaagctgga ggagctagag cttgatgag 39 

<210> 140 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 140 

Ala Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
15 10 
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<210> 141 
<211> 1024 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Size exclusion 
target sequence 

<400> 141 

gccgacctca gtcttgtgga tgcgttgaca gaaccacctc cagaaattga gggagaaata 60 
aagcgagact tcatggctgc gctggaggca gagccctatg atgacatcgt gggagaaact 120 
gtggagaaaa ctgagtttat tcctctcctg gatggtgatg agaaaaccgg gaactcagag 180 
tccaaaaaga aaccctgctt agacactagc caggttgaag gtatcccatc ttctaaacca 240 
acactcctag ccaatggtga tcatggaatg gaggggaata acactgcagg gtctccaact 300 
gacttccttg aagagagagt ggactatccg gattatcaga gcagccagaa ctggccagaa 360 
gatgcaagct tttgtttcca gcctcagcaa gtgttagata ctgaccaggc tgagcccttt 420 
aacgagcacc gtgatgatgg tttggcagat ctgctctttg tctccagtgg acccacgaac 480 
gcttctgcat ttacagagcg agacaatcct tcagaagaca gttacggtat gcttccctgt 540 
gactcatttg cttccacggc tgttgtatct caggagtggt ctgtgggagc cccaaactct 600 
ccatgttcag agtcctgtgt ctccccagag gttactatag aaaccctaca gccagcaaca 660 
gagctctcca aggcagcaga agtggaatca gtgaaagagc agctgccagc taaagcattg 720 
gaaacgatgg cagagcagac cactgatgtg gtgcactctc catccacaga cacaacacca 780 
ggcccagaca cagaggcagc actggctaaa gacatagaag agatcaccaa gccagatgtg 840 
atattggcaa atgtcacgca gccatctact gaatcggata tgttcctggc ccaggacatg 900 
gaactactca caggaacaga ggcagcccac gctaacaata tcatattgcc tacagaacca 960 
gacgaatctt caaccaagga tgtagcacca cctatggaag aagaaattgt cccaggcaat 1020 
9ata 1024 

<210> 142 
<211> 566 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Size exclusion 
target sequence 
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<400> 142 

Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu lie 
15 10 15 

Glu Gly Glu lie Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu Pro 
20 25 30 

Tyr Asp Asp lie Val Gly Glu Thr Val Glu Lys Thr Glu Phe lie Pro 
35 40 45 

Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys Lys 
50 55 60 

Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys Pro 
^5 70 75 80 

Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr Ala 
85 90 95 

Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp Tyr 
100 105 110 

Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin Pro 
115 120 125 

Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His Arg 
130 135 140 

Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr Asn 
150 155 160 

Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr Gly 
165 170 175 

Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin Glu 
180 185 190 

Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val Ser 
195 200 205 

Pro Glu Val Thr lie Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser Lys 
210 215 220 

Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala Leu 
225 230 235 240 

Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser Thr 
245 250 255 

Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp He 
260 265 270 

Glu Glu lie Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin Pro 
275 280 285 

Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu Thr 
290 295 300 
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Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu Pro 
305 310 315 320 

Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu He 
325 330 335 

Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr Leu 
340 345 350 

Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr Lys 
355 360 365 

Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu He 
370 375 380 

Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro Val 
385 390 395 400 

Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr Glu 
405 410 415 

Val Val Leu Pro Ser Asp Pro He Thr Thr Leu Thr Lys Asp Val Thr 
420 425 430 

Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro Ser 
435 440 445 

Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr Glu 
450 455 . 460 

Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser Glu 
465 470 475 480 

Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val Ala 
485 490 . 495 

Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser Val 
500 505 510 

Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys Asn 
515 520 525 

Ala Asp Leu His Ser Gly Thr Glu Leu He Val Asp Asn Ser Met Ala 
530 535 540 

Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr Val 
545 550 555 560 

Pro He Lys Asp Lys Gly 
565 



<210> 143 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 143 

atgtgggcaa tcgggattac tgttctggtt atcttcatca tcatcatcat cgtgtgggtt 60 



<210> 144 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 144 

Met Trp Ala He Gly He Thr Val Leu Val He Phe He He He He 
^5 10 15 

He Val Trp Val Val 
20 



<210> 145 
<211> 61 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 145 

atgtgggcga tagggatcag tgtcctggtg atcattgtca tcatcatcat cgtgtggtgt 60 



<210> 146 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Vesicle 
membrane target sequence 

<400> 146 

Met Trp Ala He Gly He Ser Val Leu Val He He Val He He He 
15 10 15 

He Val Trp Cys 
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20 



<210> 147 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 147 

gacctgcaga agaagctgga ggagctggaa cttgacgag 

<210> 148 
<211> 13 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Nuclear Export 
target sequence 

<400> 148 

Asp Leu Gin Lys Lys Leu Glu Glu Leu Glu Leu Asp Glu 
^5 10 



39 



<210> 149 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 149 
tctaaactg 



<210> 150 
<211> 3 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Peroxisome 
target sequence 

<400> 150 
Ser Lys Leu 
1 
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<210> 151 

<211> 3378 

<212> DNA 

<213> Mus musculus 

<220> 

<221> CDS 

<222> (1) . . (3375) 

<400> 151 

atg gcc gac etc agt ctt gtg gat gcg ttg aca gaa cca cct cca gaa 48 
Met Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
15 10 15 

att gag gga gaa ata aag cga gac ttc atg get geg ctg gag gca gag 96 
He Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 30 

cce tat gat gac ate gtg gga gaa act gtg gag aaa act gag ttt att 144 
Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He 
35 40 45 

cet etc ctg gat ggt gat gag aaa acc ggg aac tea gag tec aaa aag 192 
Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys 
50 55 60 

aaa ccc tgc tta gac act age eag gtt gaa ggt ate cca tct tet aaa 240 
Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
^5 70 75 80 



cca aca etc eta gee aat ggt gat cat gga atg gag ggg aat aac act 
Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 



160 



aac get tet gca ttt aca gag ega gac aat cct tea gaa gac agt tac 
Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 



286 



gca ggg tct cca act gac ttc ctt gaa gag aga gtg gac tat ccg gat 336 
Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 110 

tat eag age age eag aac tgg cca gaa gat gca age ttt tgt ttc cag 3 84 
Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
H5 120 125 

cct eag eaa gtg tta gat act gac cag get gag cce ttt aac gag cac 432 
Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His 
130 135 140 

cgt gat gat ggt ttg gca gat ctg etc ttt gtc tec agt gga ecc aeg 480 
Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 150 155 



528 



ggt atg ctt ccc tgt gac tea ttt get tec acg get gtt gta tct cag 576 
Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
IflO 185 190 
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gag tgg tct gtg gga gcc cca aac tct cca tgt tea gag tec tgt qtc 624 
Glu Trp ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu Ser Cys Val 

200 205 

tec cca gag gtt act ata gaa acc eta cag cca gca aca gag etc tec 672 
Ser Pro Glu Val Thr lie Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
2X0 215 220 

aag gca gca gaa gtg gaa tea gtg aaa gag cag ctg cca get aaa gca 720 
Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala 
230 235 240 

ttg gaa acg atg gca gag cag acc act gat gtg gtg cac tct cca tec 768 
Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

aca gac aca aca cca ggc cca gae aca gag gca gca ctg get aaa gac 816 
Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 

ata gaa gag ate acc aag cca gat gtg ata ttg gca aat gtc acg cag 
He Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin 



275 280 



265 



aca gga aca gag gca gcc cac get aac aat ate ata ttg ect aca gaa 
Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 



320 



864 



cca tct act gaa teg gat atg ttc ctg gcc cag gac atg gaa eta etc 912 
Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 



960 



1008 



1056 



cca gac gaa tct tea ace aag gat gta gca cca cet atg gaa gaa gaa 
Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu 
325 330 . 335 

att gtc cca ggc aat gat acg aca tec eee aaa gaa aca gag aca aca 
He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr 
340 345 350 

ctt cca ata aaa atg gac ttg gca cca ect gag gat gtg tta ett ace 1104 
Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr 
355 360 365 

aaa gaa aca gaa eta gcc cca gee aag ggc atg gtt tea etc tea gaa 1152 
Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 3B0 

r^^ Hf^ ""^^ 9^*^ 9tt cgc tct gca gaa ata ect 1200 

He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
390 395 400 

111 H?^ ^^"^ ^""^ 9^9 9tg gtc ctg gca aca 1248 

Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 415 
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gaa gtg gta ctg ccc tea gat ccc ata aca aca ttg aca aag gat gtg 1296 
Glu Val Val Leu Pro Ser Asp Pro lie Thr Thr Leu Thr Lys Asp Val 
420 425 430 

aca etc eee tta gaa gca gag aga ccg ttg gtg acg gae atg act eea 1344 
Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 

tct ctg gaa aca gaa atg acc eta ggc aaa gag aca get eea ccc aca 1392 
Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 

gaa aca aat ttg gge atg gcc aaa gae atg tct oca etc cca gaa tea 1440 
Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

gaa gtg act ctg ggc aag gae gtg gtt ata ett cca gaa aca aag gtg 1488 
Glu val Thr Leu Gly Lys Asp Val Val lie Leu Pro Glu Thr Lys Val 
485 490 495 

get gag ttt aac aat gtg act cca ctt tea gaa gaa gag gta acc tea 1536 
Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser 
500 505 510 

gtc aag gae atg tct ccg tct gca gaa aca gag get ccc ctg get aag 1584 
Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
515 520 525 

aat get gat ctg cae tea gga aca gag ctg att gtg gae aac age atg 1632 
Asn Ala Asp Leu His Ser Gly Thr Glu Leu He Val Asp Asn Ser Met 
530 535 540 

get cca gcc tee gat ett gca ctg eee ttg gaa aca aaa gta gca aca 1680 
Ala Pro Ala Ser Asp Leu Ala . Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 560 

gtt eea att aaa gae aaa gga act gta eag act gaa gaa aaa cca egt 1728 
Val Pro He Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 

gaa gae tee cag tta gca tct atg cag cae aag gga eag tea aca gta 1776 
Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 590 

cct cet tgc acg get tea cca gaa cca gtc aaa get gca gaa caa atg 1824 
Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 

tct acc tta cca ata gat gca cct tet cca tta gag aac tta gag eag 1872 
Ser Thr Leu Pro He Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 

aag gaa acg cct ggc age cag cct tct gag cct tgc tea gga gta tec . 1920 
Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu Pro Cys Ser Gly Val Ser 
625 630 635 640 

egg caa gaa gaa gca aag get get gta ggt gtg act gga aat gae ate 1968 
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Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp He 
645 650 655 

act acc ccg cca aac aag gag cca cca cca age cca gaa aag aaa gca 
Thr Thr Pro Pro Asn Lye Glu Pro Pro Pro Ser Pro Glu Lys Lys Ala 
660 665 670 



aag gtt gee gaa aag egg aec tct cca tec aag cct tea tct gee cca 
Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
755 760 765 



2016 



aag cct ttg gee acc act caa ect gca aag act tea aca teg aaa gee 2064 
Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
675 680 685 

aaa aca cag ccc act tct etc cct aag caa cca get ccc acc acc tct 2112 
Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
650 695 700 

9gt ggg ttg aat aaa aaa ccc atg age etc gee tea ggc tea gtg cca 2160 
Gly Gly Leu Asn Lys Lys Pro Met Ser Leu Ala Ser Gly Ser Val Pro 
705 710 715 720 

get gee cca eac aaa cgc ect get get gee act get act gee agg ect 2208 
Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 

tec acc eta cct gee aga gae gtg aag cca aag cca att aca gaa get 2256 
Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro He Thr Glu Ala 
740 745 750 



2304 



gcc etc aaa cct gga cct aaa acc ace cca acc gtt tea aaa gee aca 2352 
Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 775 780 

tct ccc tea act ett gtt tee act gga cca agt agt aga agt cca get 2400 
Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 
785 790 795 800 

aca act ctg cct aag agg cca acc age ate aag act gag ggg aaa cct 2448 
Thr Thr Leu Pro Lys Arg Pro Thr Ser He Lys Thr Glu Gly Lys Pro 
805 810 815 

get gat gtc aaa agg atg act get aag tct gcc tea get gac ttg agt 2496 
Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 825 830 

cgc tea aag acc acc tct gee agt tct gtg aag aga aac acc act ccc 2544 
Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg Asn Thr Thr Pro 
835 840 845 

act ggg gca gca ccc cca gca ggg atg act tec act ega gtc aag ccc 2592 
Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 

atg tct gca ect age cgc tct tct ggg get ett tct gtg gac aag aag 2640 
Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lys Lys 
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865 



870 



875 



880 



ccc act tec act aag cct age tec tct get ccc agg gtg age cge ctg 
Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 
885 890 895 



2688 



gee aca act gtt tct gcc cct gac ctg aag agt gtt ege tec aag gte 2736 
Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val Arg Ser Lys Val 
900 905 910 

ggc tct aca gaa aae ate aaa cac cag cct gga gga ggc egg gcc aaa 2784 
Gly Ser Thr Glu Asn lie Lys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 

gta gag aaa aaa aca gag gca get ace aca get ggg aag cct gaa cet 2832 
Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 
930 935 940 

aat gca gtc act aaa gca gcc ggc tec att gcg agt gca cag aaa ccg 2880 
Asn Ala Val Thr Lys Ala Ala Gly Ser lie Ala Ser Ala Gin Lys Pro 
950 955 960 

cct get ggg aaa gte cag ata gta tec aaa aaa gtg age tac agt cat 2928 
Pro Ala Gly Lys Val Gin lie Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 975 

att caa tec aag tgt gtt tec aag gae aat att aag cat gtc cet gga 2976 
He Gin Ser Lys Cys Val Ser Lys Asp Asn He Lys His Val Pro Gly 
980 985 990 

tgt ggc aat gtt cag att eag aac aag aaa gtg gac ata tec aag gtc 3024 
Cys Gly Asn Val Gin He Gin Asn Lys Lys Val Asp He Ser Lys Val 
995 1000 1005 

tec tec aag tgt ggg tec aaa get aat ate aag cac aag cct ggt gga 3 072 
Ser Ser Lys Cys Gly Ser Lys Ala Asn He Lys His Lys Pro Gly Gly 
1010 1015 1020 



gga gat gtc aag att gaa agt cag aag ttg aac ttc aag gag aag gcc 
Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu Lys Ala 
1025 1030 1035 



1040 



3120 



caa gcc aaa gtg gga tec ett gat aac gtt ggc cac ttt cct gca gga 
Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055 

ggt gee gtg aag act gag ggc ggt ggc agt gag gee ctt ccg tgt cca 
Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 



3168 



3216 



ggc ccc ccc get ggg gag gag cca gtc ate cet gag get gcg cct gac 3264 
Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1080 1085 

cgt ggc gcc cct act tea gee agt ggc etc agt ggc cac ace ace ctg 3312 
Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 1100 
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tea ggg ggt ggt gac caa agg gag ccc cag acc ttg gac age eag ate 3360 
Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin He 
1105 1110 1115 1120 

cag gag aca age ate taa 3378 
Gin Glu Thr Ser He 
1125 



<210> 152 
<211> 1125 
<212> PRT 

<213> Mus imisculus 
<400> 152 

Met Ala Asp Leu Ser Leu Val Asp Ala Leu Thr Glu Pro Pro Pro Glu 
15 10 15 

He Glu Gly Glu He Lys Arg Asp Phe Met Ala Ala Leu Glu Ala Glu 
20 25 30 

Pro Tyr Asp Asp He Val Gly Glu Thr Val Glu Lys Thr Glu Phe He 
35 40 45 

Pro Leu Leu Asp Gly Asp Glu Lys Thr Gly Asn Ser Glu Ser Lys Lys 
50 55 60 

Lys Pro Cys Leu Asp Thr Ser Gin Val Glu Gly He Pro Ser Ser Lys 
65 70 75 80 

Pro Thr Leu Leu Ala Asn Gly Asp His Gly Met Glu Gly Asn Asn Thr 
85 90 95 

Ala Gly Ser Pro Thr Asp Phe Leu Glu Glu Arg Val Asp Tyr Pro Asp 
100 105 110 

Tyr Gin Ser Ser Gin Asn Trp Pro Glu Asp Ala Ser Phe Cys Phe Gin 
115 120 125 

Pro Gin Gin Val Leu Asp Thr Asp Gin Ala Glu Pro Phe Asn Glu His 
130 135 140 

Arg Asp Asp Gly Leu Ala Asp Leu Leu Phe Val Ser Ser Gly Pro Thr 
145 150 155 160 

Asn Ala Ser Ala Phe Thr Glu Arg Asp Asn Pro Ser Glu Asp Ser Tyr 
165 170 175 

Gly Met Leu Pro Cys Asp Ser Phe Ala Ser Thr Ala Val Val Ser Gin 
180 185 190 

Glu Trp Ser Val Gly Ala Pro Asn Ser Pro Cys Ser Glu . Ser Cys Val 
195 200 205 

Ser Pro Glu Val Thr He Glu Thr Leu Gin Pro Ala Thr Glu Leu Ser 
210 215 220 
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Lys Ala Ala Glu Val Glu Ser Val Lys Glu Gin Leu Pro Ala Lys Ala 
225 230 235 240 

Leu Glu Thr Met Ala Glu Gin Thr Thr Asp Val Val His Ser Pro Ser 
245 250 255 

Thr Asp Thr Thr Pro Gly Pro Asp Thr Glu Ala Ala Leu Ala Lys Asp 
260 265 270 

He Glu Glu He Thr Lys Pro Asp Val He Leu Ala Asn Val Thr Gin 
275 280 285 

Pro Ser Thr Glu Ser Asp Met Phe Leu Ala Gin Asp Met Glu Leu Leu 
290 295 300 

Thr Gly Thr Glu Ala Ala His Ala Asn Asn He He Leu Pro Thr Glu 
305 310 315 320 

Pro Asp Glu Ser Ser Thr Lys Asp Val Ala Pro Pro Met Glu Glu Glu 
325 330 335 

He Val Pro Gly Asn Asp Thr Thr Ser Pro Lys Glu Thr Glu Thr Thr 
340 345 350 

Leu Pro He Lys Met Asp Leu Ala Pro Pro Glu Asp Val Leu Leu Thr 
355 360 365 

Lys Glu Thr Glu Leu Ala Pro Ala Lys Gly Met Val Ser Leu Ser Glu 
370 375 380 

He Glu Glu Ala Leu Ala Lys Asn Asp Val Arg Ser Ala Glu He Pro 
385 390 395 400 

Val Ala Gin Glu Thr Val Val Ser Glu Thr Glu Val Val Leu Ala Thr 
405 410 . 415 

Glu Val Val Leu Pro Ser Asp Pro He Thr Thr Leu Thr Lys Asp Val 
420 425 430 

Thr Leu Pro Leu Glu Ala Glu Arg Pro Leu Val Thr Asp Met Thr Pro 
435 440 445 

Ser Leu Glu Thr Glu Met Thr Leu Gly Lys Glu Thr Ala Pro Pro Thr 
450 455 460 

Glu Thr Asn Leu Gly Met Ala Lys Asp Met Ser Pro Leu Pro Glu Ser 
465 470 475 480 

Glu Val Thr Leu Gly Lys Asp Val Val He Leu Pro Glu Thr Lys Val 
485 490 495 

Ala Glu Phe Asn Asn Val Thr Pro Leu Ser Glu Glu Glu Val Thr Ser 
500 505 510 



Val Lys Asp Met Ser Pro Ser Ala Glu Thr Glu Ala Pro Leu Ala Lys 
515 520 525 
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Asn Ala Asp Leu His Ser Gly Thr Glu Leu He Val Asp Asn Ser Met 
530 535 540 

Ala Pro Ala Ser Asp Leu Ala Leu Pro Leu Glu Thr Lys Val Ala Thr 
545 550 555 560 

Val Pro He Lys Asp Lys Gly Thr Val Gin Thr Glu Glu Lys Pro Arg 
565 570 575 

Glu Asp Ser Gin Leu Ala Ser Met Gin His Lys Gly Gin Ser Thr Val 
580 585 590 

Pro Pro Cys Thr Ala Ser Pro Glu Pro Val Lys Ala Ala Glu Gin Met 
595 600 605 

Ser Thr Leu Pro He Asp Ala Pro Ser Pro Leu Glu Asn Leu Glu Gin 
610 615 620 

Lys Glu Thr Pro Gly Ser Gin Pro Ser Glu Pro Cys Ser Gly Val Ser 
625 630 635 640 

Arg Gin Glu Glu Ala Lys Ala Ala Val Gly Val Thr Gly Asn Asp He 
645 650 655 

Thr Thr Pro Pro Asn Lys Glu Pro Pro Pro Ser Pro Glu Lys Lys Ala 
660 665 670 

Lys Pro Leu Ala Thr Thr Gin Pro Ala Lys Thr Ser Thr Ser Lys Ala 
675 680 685 

Lys Thr Gin Pro Thr Ser Leu Pro Lys Gin Pro Ala Pro Thr Thr Ser 
690 695 700 

Gly Gly Leu Asn Lys Lys Pro Met Ser Leu Ala Ser Gly Ser Val Pro 
705 710 .715 720 

Ala Ala Pro His Lys Arg Pro Ala Ala Ala Thr Ala Thr Ala Arg Pro 
725 730 735 

Ser Thr Leu Pro Ala Arg Asp Val Lys Pro Lys Pro He Thr Glu Ala 
740 745 750 

Lys Val Ala Glu Lys Arg Thr Ser Pro Ser Lys Pro Ser Ser Ala Pro 
755 760 765 

Ala Leu Lys Pro Gly Pro Lys Thr Thr Pro Thr Val Ser Lys Ala Thr 
770 775 780 

Ser Pro Ser Thr Leu Val Ser Thr Gly Pro Ser Ser Arg Ser Pro Ala 
785 790 795 800 

Thr Thr Leu Pro Lys Arg Pro Thr Ser He Lys Thr Glu Gly Lys Pro 
805 810 815 

Ala Asp Val Lys Arg Met Thr Ala Lys Ser Ala Ser Ala Asp Leu Ser 
820 825 830 
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Arg Ser Lys Thr Thr Ser Ala Ser Ser Val Lys Arg Asn Thr Thr Pro 
835 840 845 

Thr Gly Ala Ala Pro Pro Ala Gly Met Thr Ser Thr Arg Val Lys Pro 
850 855 860 

Met Ser Ala Pro Ser Arg Ser Ser Gly Ala Leu Ser Val Asp Lys Lys 
870 875 880 

Pro Thr Ser Thr Lys Pro Ser Ser Ser Ala Pro Arg Val Ser Arg Leu 
885 890 895 

Ala Thr Thr Val Ser Ala Pro Asp Leu Lys Ser Val Arg Ser Lys Val 
900 905 910 

Gly Ser Thr Glu Asn He Lys His Gin Pro Gly Gly Gly Arg Ala Lys 
915 920 925 

Val Glu Lys Lys Thr Glu Ala Ala Thr Thr Ala Gly Lys Pro Glu Pro 
930 935 940 

Asn Ala Val Thr Lys Ala Ala Gly Ser He Ala Ser Ala Gin Lys Pro 
950 955 960 

Pro Ala Gly Lys Val Gin He Val Ser Lys Lys Val Ser Tyr Ser His 
965 970 975 

He Gin Ser Lys Cys Val Ser Lys Asp Asn He Lys His Val Pro Gly 
980 985 990 

Cys Gly Asn Val Gin He Gin Asn Lys Lys Val Asp He Ser Lys Val 
995 1000 1005 

Ser Ser Lys Cys Gly Ser Lys Ala Asn He Lys His Lys Pro Gly Gly 
1010 1015 1020 

Gly Asp Val Lys He Glu Ser Gin Lys Leu Asn Phe Lys Glu Lys Ala 
^025 1030 1035 1040 

Gin Ala Lys Val Gly Ser Leu Asp Asn Val Gly His Phe Pro Ala Gly 
1045 1050 1055 

Gly Ala Val Lys Thr Glu Gly Gly Gly Ser Glu Ala Leu Pro Cys Pro 
1060 1065 1070 

Gly Pro Pro Ala Gly Glu Glu Pro Val He Pro Glu Ala Ala Pro Asp 
1075 1080 1085 

Arg Gly Ala Pro Thr Ser Ala Ser Gly Leu Ser Gly His Thr Thr Leu 
1090 1095 1100 

Ser Gly Gly Gly Asp Gin Arg Glu Pro Gin Thr Leu Asp Ser Gin He 
1110 1115 1120 

Gin Glu Thr Ser He 
1125 
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<210> 153 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 153 

tcatcatccg gagctggagc cggagctggc cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 



<210> 154 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 154 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtacttgc tacttcatc 99 



<210> 155 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 155 

tcatcatccg gagctgga 



<210> 156 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
- <220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 156 

gaagaaggat ccggcact 
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<210> 157 
<211> 96 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
ol igonuc 1 eo t i de 

<400> 157 

tcatcatccg gaagaaggaa acgacaaaag cgatcggctg ttaaatctga aggaaagaga 60 
aagtgtgacg aagttgatgg aattgatgaa gtagca 96 

<210> 158 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 158 

tcatcatccg gaagaagg 18 

<210> 159 
<211> 60 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 159 

tcatcatccg gaagaaggaa acgacaaaag cgatcgacaa gacttgttga aattgacaac 60 

<210> 160 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 160 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactgtt gtcaatttc 99 
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<210> 161 
<211> 84 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 161 

tcatcatccg gaagaaggaa acgacaaaag cgatcgtatc aaaaaggaat accagttgaa 60 
acagacagcg aagagcaacc ttat 84 

<210> 162 
<211> 99 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 162 

gaagaaggat ccggcacttg ggggtgtaga atgaacaccc tccaagctga gcttgcacag 60 
gatttcgtgg acagtagaca tagtactata aggttgctc 99 

<210> 163 
<211> 60 
<212> DNA 

<213> Artificial Secpience 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 163 

tcatcatccg gaagaaaacg tatacgtact tacctcaagt cctgcaggcg gatgaaaaga 60 

<210> 164 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 164 

gaagaacgat cgagtaaggt gggaaggaat aggtcgagac atctcaaaac cacttctttt 60 
cat 



128 



wo 00/50872 



PCTAJSOO/04794 



<210> 165 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 165 

tcatcatccg gaagaaaa 

<210> 166 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<400> 166 

gaagaacgat cgagtaag 

<210> 167 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1, 4, 5 
siibstrate recognition sequence 

<400> 167 
ttagaacatg acaa 

<210> 168 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Caspase-1,4, 5 
substrate recognition sequence 

<400> 168 
Leu Glu His Asp 
1 

<210> 169 
<211> 1380 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP27 

<220> 

<221> CDS 

<222> (1) . . (1380) 

<400> 169 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Iteu 
15 10 15 

gtc gag ctg gac ggc gae gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag gge gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

tgc acc acc gge aag ctg ece gtg ccc tgg eee acc etc gtg aee aec 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg ace tac ggc gtg cag tgc ttc age cgc tac ccc gac cae atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg eee gaa gge tac gtc eag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

ege acc ate ttc tte aag gae gac ggc aac tac aag aec ege gee gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag tte gag ggc gac acc ctg gtg aac ege ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gae tte aag gag gac gge aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aae age cac aac gte tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cae aac ate gag gac ggc age 528 
Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc eee ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 



age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys J\rg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



gtg ace gee gee ggg ate act etc ggc atg gac gag ctg tac aag tec 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



720 



gga etc aga tet cga gcg gcg tec aga gca gag tea gee age atg acc 768 
Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 



gag cgc cgc gtc ccc ttc teg etc ctg egg ggc ccc age tgg gac ccc 816 
Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

ttc cgc gac tgg tac ccg eat age cgc etc ttc gac cag gee ttc ggg 864 
Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala Phe Gly 
275 280 285 

ctg ccc egg ctg ccg gag gag tgg teg cag tgg tta ggc ggc age age 912 
Leu Pro Arg Leu Pro Glu Glu Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 



tgg cca ggc tac gtg cgc ccc ctg ccc ccc gcc gcc ate gag age ccc 960 
Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala He Glu Ser Pro 
305 310 315 320 

gca gtg gcc gcg ccc gcc tac age cgc gcg etc age egg caa etc age 1008 
Ala Val Ala Ala Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 330 335 

age ggg gtc teg gag ate egg cac act gcg gac cgc tgg cgc gtg tec 1056 
Ser Gly Val Ser Glu He Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 345 350 



Ctg gat gtc aac cac ttc gcc ccg gac gag ctg acg gtc aag acc aag 1104 
Leu Asp Val Asn His Phe Ala Pro Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

gat ggc gtg gtg gag ate acc ggc aag cac gag gag egg cag gac gag 1152 
Asp Gly Val Val Glu He Thr Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

cat ggc tac ate tec egg tge ttc acg egg aaa tac acg ctg ccc ccc 1200 
His Gly Tyr He Ser Arg Cys Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

get gtg gac ccc acc caa gtt tee tec tec ctg tec cet gag ggc aca 1248 
Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 

Ctg ace gtg gag gee ccc atg ccc aag eta gcc acg cag tec aac gag 1296 
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Leu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

ate ace ate eea gtc ace ttc gag teg egg gcc cag ctt ggg ggc eca 1344 
lie Thr lie Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 

gaa get gca aaa tec gat gag act gcc gcc aag taa 1380 
Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 

450 455 460 



<210> 170 
<211> 459 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPP-HSP27 
<400> 170 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 
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Pro Val Leu lieu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lya Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Ala Ser Arg Ala Glu Ser Ala Ser Met Thr 
245 250 255 

Glu Arg Arg Val Pro Phe Ser Leu Leu Arg Gly Pro Ser Trp Asp Pro 
260 265 270 

Phe Arg Asp Trp Tyr Pro His Ser Arg Leu Phe Asp Gin Ala Phe Gly 
275 280 285 

Leu Pro Arg Leu Pro Glu Glu Trp Ser Gin Trp Leu Gly Gly Ser Ser 
290 295 300 

Trp Pro Gly Tyr Val Arg Pro Leu Pro Pro Ala Ala lie Glu Ser Pro 
305 310 315 320 

Ala Val Ala Ala Pro Ala Tyr Ser Arg Ala Leu Ser Arg Gin Leu Ser 
325 330 335 

Ser Gly Val Ser Glu lie Arg His Thr Ala Asp Arg Trp Arg Val Ser 
340 345 350 

Leu Asp Val Asn His Phe Ala Pro Asp Glu Leu Thr Val Lys Thr Lys 
355 360 365 

Asp Gly Val Val Glu lie Thr Gly Lys His Glu Glu Arg Gin Asp Glu 
370 375 380 

His Gly Tyr lie Ser Arg Cys Phe Thr Arg Lys Tyr Thr Leu Pro Pro 
385 390 395 400 

Gly Val Asp Pro Thr Gin Val Ser Ser Ser Leu Ser Pro Glu Gly Thr 
405 410 415 

Leu Thr Val Glu Ala Pro Met Pro Lys Leu Ala Thr Gin Ser Asn Glu 
420 425 430 

He Thr He Pro Val Thr Phe Glu Ser Arg Ala Gin Leu Gly Gly Pro 
435 440 445 

Glu Ala Ala Lys Ser Asp Glu Thr Ala Ala Lys 
450 455 



<210> 171 
<211> 2823 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: GFP-HSP70 

<220> 

<221> CDS 

<222> (1)..{2823) 

<400> 171 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Aan Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg ace ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 . 

tgc acc acc ggc aag ctg eec gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg ace tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac ace ctg gtg aac cgc ate gag ctg aag ggc 3 84 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
lis 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac eac tac ctg age acc cag tee gee ctg 624 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act etc gge atg gae gag etg tac aag tee 720 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga atg teg gtg gtg gge ata gae etg gge ttc cag age tgc tac gtc 768 
Gly Met Ser Val Val Gly He Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

get gtg gcc cgc gcc gge gge ate gag act ate get aat gag tat age 816 
Ala Val Ala Arg Ala Gly Gly He Glu Thr He Ala Asn Glu Tyr Ser 
260 265 270 

gae cgc tgc acg ccg get tgc att tet ttt ggt ect aag aat cgt tea 864 
Asp Arg Cys Thr Pro Ala Cys He Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

att gga gca gca get aaa age cag gta att tet aat gea aag aac aca 912 
He Gly Ala Ala Ala Lys Ser Gin Val He Ser Asn Ala Lys Asn Thr 
290 295 300 

gtc caa gga ttt aaa aga ttc cat gge ega gca ttc tet gat cca ttt 960 
Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

gtg gag gca gaa aaa tet aac ctt gca tat gat att gtg cag tgg cct 1008 
Val Glu Ala Glu Lys Ser Asn lieu Ala Tyr Asp He Val Gin Trp Pro 
325 330 335 

aca gga tta aca ggt ata aag gtg aca tat .atg gag gaa gag ega aat 1056 
Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 350 

ttt acc act gag caa gtg act gcc atg ctt ttg tec aaa etg aag gag 1104 
Phe Thr Thr Glu Gin Val Thr Ala Met Leu Leu Ser Lys Leu Lys Glu 
355 360 365 

aca gee gaa agt gtt ctt aag aag cct gta gtt gac tgt gtt gtt teg 1152 
Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 

gtt ect tgt ttc tat act gat gca gaa aga ega tea gtg atg gat gca 1200 
Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
385 390 395 400 

aca cag att get ggt ctt aat tgc ttg ega tta atg aat gaa acc act 1246 
Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

gca gtt get ctt gca tat gga ate tat aag cag gat ctt cct cgc tta 1296 
Ala Val Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 
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420 425 430 

gaa gag aaa cca aga aat gta gtt ttt gta gac atg ggc cac tct get 1344 
Glu Glu Lys Pro Arg Asn Val Val Phe Val Asp Met Gly Hie Ser Ala 
435 440 445 

tat caa gtt tct gta tgt gca ttt aat aga gga aaa ctg aaa gtt ctg 1392 
Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 455 460 

gcc act gca ttt gac acg aca ttg gga ggt aga aaa ttt gat gaa gtg 1440 
Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 
465 470 475 480 

tta gta aat cac ttc tgt gaa gaa ttt ggg aag aaa tac aag eta gac 1488 
Leu Val Asn His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 

att aag tec aaa ate cgt gca tta tta cga etc tct cag gag tgt gag 1536 
He Lys Ser Lys He Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 
500 505 510 

aaa etc aag aaa ttg atg agt gca aat get tea gat etc cet ttg age 1584 
Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 
515 520 525 

att gaa tgt ttt atg aat gat gtt gat gta tct gga act atg aat aga 1632 
He Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Arg 
530 535 540 

ggc aaa ttt ctg gag atg tgc aat gat etc tta get aga gtg gag cca 1680 
Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 
545 550 555 560 

cca ctt cgt agt gtt ttg gaa caa acc aag tta aag aaa gaa gat att 1728 
Pro Leu Arg Ser Val Leu Glu Gin Thr Lys Leu Lys Lys Glu Asp He 
565 570 575 

tat gca gtg gag ata gtt ggt ggt get aca cga ate cet gcg gta aaa 1776 
Tyr Ala Val Glu He Val Gly Gly Ala Thr Arg He Pro Ala Val Lys 
580 585 590 

gag aag ate age aaa ttt ttc ggt aaa gaa ctt agt aca aca tta aat 1824 
Glu Lys He Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 605 

get gat gaa get gte act cga ggc tgt gca ttg cag tgt gcc ate tta 1872 
Ala Asp Glu Ala Val Thr Arg Gly Cys Ala Leu Gin Cys Ala He Leu 
610 615 620 

teg cet get ttc aaa gte aga gaa ttt tct ate act gat gta gta cca 1920 
Ser Pro Ala Phe Lys Val Arg Glu Phe Ser He Thr Asp Val Val Pro 
625 630 635 640 

tat cca ata tct ctg aga tgg aat tct cca get gaa gaa ggg tea agt 1968 
Tyr Pro He Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 655 
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gac tgt gaa gtc ttt t:cc aaa aat cat get get cct ttc tct aaa gtt 2016 
Asp Cys Glu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

ctt aca ttt tat aga aag gaa cct ttc act ctt gag gcc tac tac age 2064 
Leu Thr Phe Tyr Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 

tct cct cag gat ttg ccc tat cca gat cct get ata get cag ttt tea 2112 
Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala lie Ala Gin Phe Ser 
690 695 700 

gtt cag aaa gtc act cct cag tct gat ggc tec agt tea aaa gtg aaa 2160 
Val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 
705 710 715 720 

gtc aaa gtt cga gta aat gtc cat ggc att ttc agt gtg tec agt gea 2208 
Val Lys Val Arg Val Asn Val His Gly He Phe Ser Val Ser Ser Ala 
725 730 735 

tet tta gtg gag gtt cac aag tct gag gaa aat gag gag cca atg gaa 2256 
Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met Glu 
740 745 750 

aca gat cag aat gca aag gag gaa gag aag atg caa gtg gac cag gag 23 04 
Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Wet Gin Val Asp Gin Glu 
755 760 765 

gaa cca eat gtt gaa gag caa cag cag cag aca cca gca gaa aat aag 2352 
Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
770 775 780 

gca gag tct gaa gaa atg gag acc tct caa get gga tec aag gat aaa 24 00 
Ala Glu Ser Glu Glu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 
785 790 .795 800 

aag atg gac caa cca ccc caa tge caa gaa ggc aaa agt gaa gac cag 2448 
Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lys Ser Glu Asp Gin 
805 810 815 

tac tgt gga cct gcc aat cga gaa tea get ata tgg cag ata gac aga 2496 
Tyr Cys Gly Pro Ala Asn Arg Glu Ser Ala He Trp Gin He Asp Arg 
820 825 630 

gag atg etc aac ttg tac att gaa aat gag ggt aag atg ate atg cag 2544 
Glu Met Leu Asn Leu Tyr He Glu Asn Glu Gly Lys Met He Met Gin 
835 840 845 

gat aaa ctg gag aag gag egg aat gat get aag aac gca gtg gag gaa 2592 
Asp Lys Leu Glu Lys Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu 
850 855 860 

tat gtg tat gaa atg aga gac aag ctt agt ggt gaa tat gag aag ttt 2640 
Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 
865 870 875 880 
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gtg agt gaa gat gat cgt aac agt ttt act ttg aaa ctg gaa gat act 2688 
Val Ser Glu Asp Asp Arg Asn Ser Phe Thr Leu Lys Leu Glu Asp Thr 
885 890 895 

gaa aat tgg ttg tat gag gat gga gaa gac cag cca aag caa gtt tat 2736 
Glu Asn Trp Leu Tyr Glu Asp Gly Glu Asp Gin Pro Lys Gin Val Tyr 
900 905 910 

gtt gat aag ttg get gaa tta aaa aat eta ggt caa cct att aag ata 2784 
Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro lie Lys lie 
915 920 925 

cgt ttc cag gaa tct gaa gaa cga cca aat tat ttg aag 2823 
Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
930 935 940 



<210> 172 
<211> 941 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSP70 
<400> 172 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp .Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
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165 



170 



175 



Val Gin l*eu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Met Ser Val Val Gly lie Asp Leu Gly Phe Gin Ser Cys Tyr Val 
245 250 255 

Ala Val Ala Arg Ala Gly Gly lie Glu Thr He Ala Asn Glu Tyr Ser 
260 265 270 

Asp Arg Cys Thr Pro Ala Cys He Ser Phe Gly Pro Lys Asn Arg Ser 
275 280 285 

lie Gly Ala Ala Ala Lys Ser Gin Val He Ser Asn Ala Lys Asn Thr 
290 295 300 

Val Gin Gly Phe Lys Arg Phe His Gly Arg Ala Phe Ser Asp Pro Phe 
305 310 315 320 

Val Glu Ala Glu Lys Ser Asn Leu Ala Tyr Asp He Val Gin Trp Pro 
325 330 335 

Thr Gly Leu Thr Gly He Lys Val Thr Tyr Met Glu Glu Glu Arg Asn 
340 345 350 

Phe Thr Thr Glu Gin Val Thr Ala Met Leu ieu Ser Lys Leu Lys Glu 
355 360 365 

Thr Ala Glu Ser Val Leu Lys Lys Pro Val Val Asp Cys Val Val Ser 
370 375 380 

Val Pro Cys Phe Tyr Thr Asp Ala Glu Arg Arg Ser Val Met Asp Ala 
385 390 395 400 

Thr Gin He Ala Gly Leu Asn Cys Leu Arg Leu Met Asn Glu Thr Thr 
405 410 415 

Ala Val Ala Leu Ala Tyr Gly He Tyr Lys Gin Asp Leu Pro Arg Leu 
420 425 430 

Glu Glu Lys Pro Arg Asn Val Val Phe Val Asp Met Gly His Ser Ala 
435 440 445 

Tyr Gin Val Ser Val Cys Ala Phe Asn Arg Gly Lys Leu Lys Val Leu 
450 455 460 



Ala Thr Ala Phe Asp Thr Thr Leu Gly Gly Arg Lys Phe Asp Glu Val 
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465 



470 



475 



480 



Leu Val Asn His Phe Cys Glu Glu Phe Gly Lys Lys Tyr Lys Leu Asp 
485 490 495 

lie Lys Ser Lys lie Arg Ala Leu Leu Arg Leu Ser Gin Glu Cys Glu 
500 505 510 

Lys Leu Lys Lys Leu Met Ser Ala Asn Ala Ser Asp Leu Pro Leu Ser 
515 520 525 

lie Glu Cys Phe Met Asn Asp Val Asp Val Ser Gly Thr Met Asn Arg 
530 535 540 

Gly Lys Phe Leu Glu Met Cys Asn Asp Leu Leu Ala Arg Val Glu Pro 
545 550 555 560 

Pro Leu Arg Ser Val Leu Glu Gin Thr Lys Leu Lys Lys Glu Asp lie 
565 570 575 

Tyr Ala Val Glu He Val Gly Gly Ala Thr Arg He Pro Ala Val Lys 
580 585 590 

Glu Lys He Ser Lys Phe Phe Gly Lys Glu Leu Ser Thr Thr Leu Asn 
595 600 605 

Ala Asp Glu Ala Val Thr Arg Gly Cys Ala Leu Gin Cys Ala He Leu 
610 615 620 

Ser Pro Ala Phe Lys Val Arg Glu Phe Ser He Thr Asp Val Val Pro 
625 630 635 640 

Tyr Pro He Ser Leu Arg Trp Asn Ser Pro Ala Glu Glu Gly Ser Ser 
645 650 655 

Asp Cys Glu Val Phe Ser Lys Asn His Ala Ala Pro Phe Ser Lys Val 
660 665 670 

Leu Thr Phe Tyr Arg Lys Glu Pro Phe Thr Leu Glu Ala Tyr Tyr Ser 
675 680 685 

Ser Pro Gin Asp Leu Pro Tyr Pro Asp Pro Ala He Ala Gin Phe Ser 
690 695 700 

Val Gin Lys Val Thr Pro Gin Ser Asp Gly Ser Ser Ser Lys Val Lys 
705 710 715 720 

Val Lys Val Arg Val Asn Val His Gly He Phe Ser Val Ser Ser Ala 
725 730 735 

Ser Leu Val Glu Val His Lys Ser Glu Glu Asn Glu Glu Pro Met Glu 
740 745 750 

Thr Asp Gin Asn Ala Lys Glu Glu Glu Lys Met Gin Val Asp Gin Glu 
755 760 765 



Glu Pro His Val Glu Glu Gin Gin Gin Gin Thr Pro Ala Glu Asn Lys 
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770 



775 



780 



Ala Glu Ser Glu Glu Met Glu Thr Ser Gin Ala Gly Ser Lys Asp Lys 
785 790 795 800 

Lys Met Asp Gin Pro Pro Gin Cys Gin Glu Gly Lys Ser Glu Asp Gin 
805 810 815 

Tyr Cys Gly Pro Ala Asn Arg Glu Ser Ala He Trp Gin He Asp Arg 
820 825 830 

Glu Met Leu Asn Leu Tyr He Glu Asn Glu Gly Lys Met He Met Gin 
835 640 845 

Asp Lys Leu Glu Lys Glu Arg Asn Asp Ala Lys Asn Ala Val Glu Glu 
850 855 860 



Tyr Val Tyr Glu Met Arg Asp Lys Leu Ser Gly Glu Tyr Glu Lys Phe 
865 870 875 880 

Val Ser Glu Asp Asp Arg Asn Ser Phe Thr Leu Lys Leu Glu Asp Thr 
885 890 895 

Glu Asn Trp Leu Tyr Glu Asp Gly Glu Asp Gin Pro Lys Gin Val Tyr 
900 905 910 

Val Asp Lys Leu Ala Glu Leu Lys Asn Leu Gly Gin Pro He Lys He 
915 920 925 

Arg Phe Gin Glu Ser Glu Glu Arg Pro Asn Tyr Leu Lys 
930 935 940 



<210> 173 

<211> 2674 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSC70 

<220> 

<221> CDS 

<222> (1)..(2673) 

<400> 173 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 
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tgc acc acc ggc aag etg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gee gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag etg aag ggc 38^ 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gee gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528. 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac ace ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac etg age acc cag tec gee etg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag Cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

gtg acc gee gee ggg ate act etc ggc atg gac gag ctg tac aag tec 720 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

gga etc aga tct atg tee aag gga cct gca gtt ggt att gat ctt ggc 768 
Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

ace acc tac tct tgt gtg ggt gtt ttc cag cac gga aaa gtc gag ata 816 
Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu He 
260 265 270 
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att gcc aat gat cag gga aac cga acc act cca age tat gtc gcc ttt 
lie Ma Asn Asp Gin Gly Asn Arg Thr Thr Pro Ser Tyr Val Ala Phe 



864 



275 280 



285 



acg gac act gaa egg ttg ate ggt gat gcc gca aag aat caa gtt gca 912 
Thr Asp Thr Glu Arg Leu lie Gly Asp Ala Ala Lys Asn Gin Val Ala 
290 295 300 

atg aac ccc aec aae aea gtt ttt gat gcc aaa cgt etg att gga cgc 960 
Met Asn Pro Thr Asn Thr Val Phe Asp Ala Lys Arg Leu He Gly Ara 
310 315 320 

aga ttt gat gat get gtt gtc eag tct gat atg aaa eat tgg ccc ttt 1008 
Arg Phe Asp Asp Ala Val Val Gin Ser Asp Met Lys His Trp Pro Phe 
325 330 335 

atg gtg gtg aat gat get ggc agg cec aag gte caa gta gaa tac aag 1056 
Met Val Val Asn Asp Ala Gly Arg Pro Lys Val Gin Val Glu Tyr Lys 
340 345 350 

gga gag acc aaa age ttc tat cca gag gag gtg tct tct atg gtt etg 1104 
Gly Glu Thr Lys Ser Phe Tyr Pro Glu Glu Val Ser Ser Met Val Leu 
355 360 365 

aca aag atg aag gaa att gca gaa gcc tac ett ggg aag act gtt acc 1152 
Thr Lys Met Lys Glu He Ala Glu Ala Tyr Leu Gly Lys Thr Val Thr 
370 375 380 

aat get gtg gte aea gtg cca get tac ttt aat gae tct cag cgt cag 1200 
Asn Ala Val Val Thr Val Pro Ala Tyr Phe Asn Asp Ser Gin Arg Gin 
390 395 400 

get aec aaa gat get gga act att get ggt etc aat gta ett aga att 1248 
Ala Thr Lys Asp Ala Gly Thr He Ala Gly Leu Asn Val Leu Arg He 
405 410 415 

att aat gag cca act get get get att get tac ggc tta gac aaa aag 1296 
He Asn Glu Pro Thr Ala Ala Ala He Ala Tyr Gly Leu Asp Lys Lys 
420 425 430 

gtt gga gca gaa aga aac gtg etc ate ttt gac etg gga ggt ggc act 1344 
Val Gly Ala Glu Arg Asn Val Leu He Phe Asp Leu Gly Gly Gly Thr 
435 440 445 

ttt gat gtg tea ate etc act att gag gat gga ate ttt gag gtc aag 1392 
Phe Asp Val Ser He Leu Thr He Glu Asp Gly He Phe Glu Val Lys 
450 455 460 

tct aca get gga gae acc cac ttg ggt gga gaa gat ttt gac aae cga 1440 
Ser Thr Ala Gly Asp Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg 
470 475 480 

atg gte aac eat ttt att get gag ttt aag cgc aag cat aag aag gac 1488 
Met Val Asn His Phe He Ala Glu Phe Lys Arg Lys His Lys Lys Asp 
485 490 495 

ate agt gag aac aag aga get gta aga cgc etc cgt act get tgt gaa 1536 
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He Ser Glu Asn hya Arg Ala Val Arg Arg Leu Arg Thr Ala Cye Glu 
500 505 510 

cgt get aag cgt acc etc tct tec age acc cag gee agt att gag ate 1584 
Arg Ala Lys Arg Thr Leu Ser Ser Ser Thr Gin Ala Ser He Glu He 
515 520 525 

gat tct etc tat gaa gga ate gac tte tat acc tee att acc cgt gcc 1632 
Asp Ser Leu Tyr Glu Gly He Asp Phe Tyr Thr Ser He Thr Arg Ala 
530 535 540 

cga ttt gaa gaa ctg aat get gac ctg ttc cgt ggc ace etg gac eea 1680 
Arg Phe Glu Glu Leu Asn Ala Asp Leu Phe Arg Gly Thr Leu Asp Pro 
5« 550 555 560 

gta gag aaa gee ctt cga gat gcc aaa eta gac aag tea cag att eat 1728 
Val Glu Lys Ala Leu Arg Asp Ala Lys Leu Asp Lys Ser Gin He His 
565 570 575 

gat att gte ctg gtt ggt ggt tct act cgt ate cec aag att cag aag 1776 
Asp lie Val Leu Val Gly Gly Ser Thr Arg He Pro Lys He Gin Lys 
580 585 590 

ctt etc caa gac ttc tte aat gga aaa gaa ctg aat aag age ate aac 1824 
Leu Leu Gin Asp Phe Phe Asn Gly Lys Glu Leu Asn Lys Ser He Asn 
595 600 605 

eet gat gaa get gtt get tat ggt gca get gte cag gea gcc ate ttg 1872 
Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val Gin Ala Ala He Leu 
610 615 620 

tct gga gac aag tct gag aat gtt caa gat ttg etg etc ttg gat gte 1920 
Ser Gly Asp Lys Ser Glu Asn Val Gin Asp Leu Leu Leu Leu Asp Val 
625 630 635 640 

act ect ctt tec ctt ggt att gaa act get ^gt gga gte atg act gte 1968 
Thr Pro Leu Ser Leu Gly He Glu Thr Ala Gly Gly Val Met Thr Val 
645 650 655 

etc ate aag cgt aat acc ace att eet acc aag cag aea cag ace tte 2016 
Leu He Lys Arg Asn Thr Thr He Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 

act acc tat tct gac aac cag ect ggt gtg ctt att cag gtt tat gaa 2064 
Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyr Glu 
675 680 685 

ggc gag cgt gcc atg aea aag gat aac aac ctg ctt ggc aag ttt gaa 2112 
Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 

etc aca ggc ata ect ect gca ccc cga ggt gtt cet cag att gaa gte 2160 
Leu Thr Gly He Pro Pro Ala Pro Arg Gly Val Pro Gin He Glu Val 
705 710 715 



720 



act ttt gac att gat gcc aat ggt ata etc aat gte tct get gtg gac 
Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Ser Ala Val Asp 



2208 
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725 730 735 

aag agt acg gga aaa gag aac aag att act ate act aat gac aag ggc 2256 
Lys Ser Thr Gly Lys Glu Asn Lys lie Thr lie Thr Asn Asp Lys Gly 
740 745 750 



cgt ttg age aag gaa gac att gaa cgt atg gtc cag gaa get gag aag 2304 
Arg Leu Ser Lys Glu Asp He Glu Arg Met Val Gin Glu Ala Glu Lys 
755 760 765 

tac aaa get gaa gat gag aag cag agg gac aag gtg tea tec aag aat 2352 
Tyr Lys Ala Glu Asp Glu Lys Gin Arg Asp Lys Val Ser Ser Lys Asn 
770 775 780 

tea ctt gag tee tat gee tte aae atg aaa gca act gtt gaa gat gag 2400 
Ser Leu Glu Ser Tyr Ala Phe Asn Met Lys Ala Thr Val Glu Asp Glu 
785 790 795 800 

aaa ctt caa ggc aag att aac gat gag gac aaa cag aag att ctg gac 2448 
Lys Leu Gin Gly Lys He Asn Asp Glu Asp Lys Gin Lys He Leu Asp 
805 810 815 

aag tgt aat gaa att ate aac tgg ctt gat aag aat cag act get gag 2496 
Lys Cys Asn Glu He He Asn Trp Leu Asp Lys Asn Gin Thr Ala Glu 
620 825 830 

aag gaa gaa ttt gaa cat caa cag aaa gag ctg gag aaa gtt tgc aac 2544 
Lys Glu Glu Phe Glu His Gin Gin Lys Glu Leu Glu Lys Val Cys Asn 
835 840 845 

ecc ate ate ace aag ctg tac cag agt gca gga ggc atg cea gga gga 2592 
Pro He He Thr Lys Leu Tyr Gin Ser Ala Gly Gly Met Pro Gly Gly 
850 855 860 

atg ect ggg gga ttt ect ggt ggt gga get cet ecc tct ggt ggt get 2640 
Met Pro Gly Gly Phe Pro Gly Gly Gly Ala Pro Pro Ser Gly Gly Ala 
865 870 875 880 

tec tea ggg ecc ace att gaa gag gtt gat taa g 2674 
Ser Ser Gly Pro Thr He Glu Glu Val Asp 
885 890 



<210> 174 
<211> 890 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSC70 
<400> 174 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 



145 



wo 00/50872 



PCT/USOO/04794 



Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 BO 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Met Ser Lys Gly Pro Ala Val Gly He Asp Leu Gly 
245 250 255 

Thr Thr Tyr Ser Cys Val Gly Val Phe Gin His Gly Lys Val Glu He 
260 265 270 

He Ala Asn Asp Gin Gly Asn Arg Thr Thr Pro Ser Tyr Val Ala Phe 
275 280 285 

Thr Asp Thr Glu Arg Leu He Gly Asp Ala Ala Lys Asn Gin Val Ala 
290 295 300 

Met Asn Pro Thr Asn Thr Val Phe Asp Ala Lys Arg Leu He Gly Arg 
305 310 315 320 

Arg Phe Asp Asp Ala Val Val Gin Ser Asp Met Lys His Trp Pro Phe 
325 330 335 
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Met Val Val Asn Asp Ala Gly Arg Pro Lys Val Gin Val Glu Tyr Lys 
340 345 350 

Gly Glu Thr Lya Ser Phe Tyr Pro Glu Glu Val Ser Ser Met Val Leu 
355 360 365 

Thr Lys Met Lys Glu lie Ala Glu Ala Tyr Leu Gly Lys Thr Val Thr 
370 375 380 

Asn Ala Val Val Thr Val Pro Ala Tyr Phe Asn Asp Ser Gin Arg Gin 
385 390 395 400 

Ala Thr Lys Asp Ala Gly Thr lie Ala Gly Leu Asn Val Leu Arg lie 
405 410 415 

lie Asn Glu Pro Thr Ala Ala Ala lie Ala Tyr Gly Leu Asp Lys Lys 
420 425 430 

Val Gly Ala Glu Arg Asn Val Leu lie Phe Asp Leu Gly Gly Gly Thr 
435 440 445 

Phe Asp Val Ser lie Leu Thr lie Glu Asp Gly lie Phe Glu Val Lys 
450 455 460 

Ser Thr Ala Gly Asp Thr His Leu Gly Gly Glu Asp Phe Asp Asn Arg 
465 470 475 480 

Met Val Asn His Phe lie Ala Glu Phe Lys Arg Lys His Lys Lys Asp 
485 490 495 

lie Ser Glu Asn Lys Arg Ala Val Arg Arg Leu Arg Thr Ala Cys Glu 
500 505 510 

Arg Ala Lys Arg Thr Leu Ser Ser Ser Thr Gin Ala Ser He Glu He 
515 520 525 

Asp Ser Leu Tyr Glu Gly He Asp Phe Tyr Thr Ser He Thr Arg Ala 
530 535 540 

Arg Phe Glu Glu Leu Asn Ala Asp Leu Phe Arg Gly Thr Leu Asp Pro 
545 550 555 560 

Val Glu Lys Ala Leu Arg Asp Ala Lys Leu Asp Lys Ser Gin He His 
565 570 575 

Asp He Val Leu Val Gly Gly Ser Thr Arg He Pro Lys He Gin Lys 
580 585 590 

Leu Leu Gin Asp Phe Phe Asn Gly Lys Glu Leu Asn Lys Ser He Asn 
595 600 605 

Pro Asp Glu Ala Val Ala Tyr Gly Ala Ala Val Gin Ala Ala He Leu 
610 615 620 

Ser Gly Asp Lys Ser Glu Asn Val Gin Asp Leu Leu Leu Leu Asp Val 
625 630 635 640 



147 



wo 00/50872 



PCT/USOO/04794 



Thr Pro Leu Ser Leu Gly He Glu Thr Ala Gly Gly Val Met Thr Val 
645 650 655 

Leu He Lys Arg Asn Thr Thr He Pro Thr Lys Gin Thr Gin Thr Phe 
660 665 670 

Thr Thr Tyr Ser Asp Asn Gin Pro Gly Val Leu He Gin Val Tyr Glu 
675 680 685 

Gly Glu Arg Ala Met Thr Lys Asp Asn Asn Leu Leu Gly Lys Phe Glu 
690 695 700 

Leu Thr Gly He Pro Pro Ala Pro Arg Gly Val Pro Gin He Glu Val 
705 710 715 720 

Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Ser Ala Val Asp 
725 730 735 

Lys Ser Thr Gly Lys Glu Asn Lys He Thr He Thr Asn Asp Lys Gly 
740 745 750 

Arg Leu Ser Lys Glu Asp He Glu Arg Met Val Gin Glu Ala Glu Lys 
755 760 765 

Tyr Lys Ala Glu Asp Glu Lys Gin Arg Asp Lys Val Ser Ser Lys Asn 
770 775 780 

Ser Leu Glu Ser Tyr Ala Phe Asn Met Lys Ala Thr Val Glu Asp Glu 
785 790 795 800 

Lys Leu Gin Gly Lys He Asn Asp Glu Asp Lys Gin Lys He Leu Asp 
805 810 815 

Lys Cys Asn Glu He He Asn Tzp Leu Asp Lys Asn Gin Thr Ala Glu 
820 825 830 

Lys Glu Glu Phe Glu His Gin Gin Lys Glu Leu Glu Lys Val Cys Asn 
835 840 845 

Pro He He Thr Lys Leu Tyr Gin Ser Ala Gly Gly Met Pro Gly Gly 
850 855 860 

Met Pro Gly Gly Phe Pro Gly Gly Gly Ala Pro Pro Ser Gly Gly Ala 
865 870 875 880 

Ser Ser Gly Pro Thr He Glu Glu Val Asp 
885 890 



<210> 175 
<211> 2458 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-HSFl 
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<220> 

<221> CDS 

<222> (1) . . (2349) 



<400> 175 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate eta 
Met Val Ser Lys Gly Glu Glu Leu Phe Thr gI? vlf HI ill Jeu 
^ 10 15 

n^^ f ''^ f^*" 3'=* 99'' ttc age gtg tee ggc 96 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser gII 

2° 25 30 

Glu m V «u r?! t^"" aag ctg ace etg aag ttc ate 144 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg ace acc 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Ja! ?S 



55 



60 



95 



cgc acc ate ttc ttc aag gac gac ggc aac tae aag acc egc gee qaq 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr A^g Sa Glu 
100 105 



110 



v!? It'' 99C gac acc ctg gtg aac cge ate gag etg aag ggc 
val Lys Phe Glu Gly Asp Thr Leu Val Asa Arg lie Glu LeS Lyf Sy 
115 120 ' 



125 



r^l t It'' 9ac ggc aac ate ctg ggg cae aag etg gag tae 

He Asp Phe Lys Glu Asp Gly Asa He Leu Gly His Lys Leu 111 T^ 

135 140 



''f'' 9tc tat ate atg gee gac aag cag aag aac 
Asn Tyr Asa Ser His Asn Val Tyr He Met Ala Isp Lys Gin Lys til 
150 155 

gge ate aag gtg aac ttc aag ate egc cae aac ate gag gac gqc aae 
Gly He Lys Val Asa Phe Lys He Arg His Asa He Glu Cp Sy sS 



192 



240 



etg acc tae gge gtg cag tgc ttc age cgc tae ccc gac cae atq aaq 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Lp His mt lyl 
" 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa gge tae gte eaq qaa 2ft« 
Gla Hxs Asp Phe Phe Lys Ser Ala Met Pro Glu Ity Tyr vS G^a Glu 



165 



175 



336 



364 



432 



460 



528 



576 



Va? r^n T r^"" "'^'^ ^^"^ "9 gac ggc 

val Gin Leu Ala Asp His Tyr Gin Gin Aen Thr Pro He Gly Lp lly 

185 

Pro rlf. f""^ ^^"^ ''^^ "^^^ "9 tec gcc ctg 624 

Pro Val Leu Leu Pro Aap Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 



195 200 



205 
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age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Ilu Phe 
210 215 220 

gtg acc gcc gcc ggg ate act cte gge atg gac gag ctg tac aag tec 720 
yal Tbr Ala Ala Gly He Thr Leu Gly Met Asp 5lu Leu Tyr Ly? Ser 
230 235 240 

gga cte aga tet cga get eaa get teg aat tct gea gte gag atg gat 76B 
Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Me? Lp 
245 250 255 

Ctg ccc gtg ggc ccc gge gcg gcg ggg ccc age aac gtc ccg gcc ttc 816 
Leu Pro val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ma Phe 
260 265 270 

il» ^^l f*^ f ^ ""^^ ^""^ ^^"^ 9*'' ^==9 a*'' ^'^'^ sac gcg etc 864 

Leu Thr Lys Leu Trp Thr Leu Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 280 285 

ate tgc tgg age ccg age ggg aac age ttc cac gtg ttc gac cag ggc 912 
He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Glv 
290 295 300 

11^ ^^'^ ^•'^ ""^^ ttc aag cac aac aac atg 960 

Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Asn Asn Met 
310 

?r c^*' 11"" ^^'^ '^^'^ SS'^ ttc egg aaa gtg gtc 1008 

Ala ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

«f! T^*" Hf^ "3 3?*" 33*= ''^^ 9*='= '^''^ 9^9 9ac gac aeg gag 1056 
Has He Glu Gin Gly Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 

lit r?^ u*'' 'u*" "^"^ ''^t ggc cag gag cag etc ctt gag aac 1104 

Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 

ate aag agg aaa gtg acc agt gtg tec acc ctg aag agt gaa gac ata 1152 
He Lys Arg Lys Val Thr Ser Val Ser Thr Leu Lys Ser Glu Asp He 
370 375 380 

aag ate cgc cag gac age gtc ace aag ctg ctg acg gac gtg cag ctg 1200 
Lys He Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
390 395 

m2 jtl f*^ '^'c atg gac tec aag etc ctg gcc atg aag 1248 

Met Lys Gly Lys Gin Glu Cys Met Asp Ser Lys Leu Leu Ala Met Lys 
*05 410 415 

cat gag aat gag get ctg tgg egg gag gtg gee age ctt egg cag aag 1296 
His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu aS Gin LyJ 
420 425 430 
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cat gcc cag caa cag aaa gtc gtc aac aag etc att cag ttc ctg ate 1344 
His Ala Gin Gin Gin Lye Val Val Asn Lys Leu He Gin Phe Leu He 
435 440 445 

tea ctg gtg cag tea aae egg ate ctg ggg gtg aag aga aag ate ecc 1392 
Ser Leu Val Gin Ser Asn Arg He Leu Gly Val Lys Arg Lys He Pro 
450 455 460 

ctg atg ctg aac gae agt gge tea gca eat tee atg cec aag tat age 1440 
Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
470 475 480 

egg cag ttc tec ctg gag cac gtc cae gge teg gge cec tae teg gee 1488 
Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

cec tec eca gcc tae age age tec age etc tac gee cet gat get gtg 1536 
Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 510 

gcc age tct gga ecc ate ate tec gac ate ace gag ctg get cet gee 1584 
Ala Ser Ser Gly Pro He He Ser Asp He Thr Glu Leu Ala Pro Ala 
515 520 525 

age ecc atg gcc tec ecc gge ggg age ata gac gag agg cec eta tec 1632 
Ser Pro Met Ala Ser Pro Gly Gly Ser He Asp Glu Arg Pro Leu Ser 
530 535 540 

age age ecc ctg gtg cgt gtc aag gag gag ecc cec age ceg cet cag 1680 
Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
^45 550 555 560 

age cec egg gta gag gag gcg agt ecc ggg cgc eca tct tee gtg gac 1728 
Ser Pro Arg Val Glu Glu Ala Ser Pro Gly Arg Pro Ser Ser Val Asp 
565 570 575 

ace etc ttg tec ccg ace gee etc att gae tee ate ctg egg gag agt 1776 
Thr Leu Leu Ser Pro Thr Ala Leu He Asp Ser He Leu Arg Glu Ser 
580 585 590 

gaa ect gcc cec gcc tec gtc aea gee etc aeg gac gee agg gge cac 1824 
Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 $05 

aeg gae aec gag gge egg ect cec tec ecc ccg cec ace tee ace cet 1872 
Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

gaa aag tge etc age gta gcc tgc ctg gae aag aat gag etc agt gac 1920 
Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

cac ttg gat get atg gae tee aae ctg gat aae ctg cag aec atg ctg 1968 
His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

age age cac gge ttc age gtg gac ace agt gee ctg ctg gac ctg ttc 2016 
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Ser Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 665 670 

age ccc teg gtg aee gtg cce gac atg age ctg cct gac ctt gac age 2064 
Ser Pro Ser Val Thr Val Pro Asp Met Ser Leu Pro Asp Leu Asp Ser 
675 680 685 

age ctg gcc agt ate caa gag etc ctg tet ecc cag gag ccc ccc agg 2112 
Ser Leu Ala Ser lie Gin Glu Leu Leu Ser Pro Gin Glu Pro Pro Arg 
690 695 700 

cct ecc gag gca gag aac age age ccg gat tea ggg aag cag ctg gtg 2160 
Pro Pro Glu Ala Glu Asn Ser Ser Pro Asp Ser Gly Lys Gin Leu Val 
705 710 715 720 

cac tae aca gcg cag ccg ctg tte ctg ctg gac ccc ggc tec gtg gac 2208 
His Tyr Thr Ala Gin Pro Leu Phe Leu Leu Asp Pro Gly Ser Val Asp 
725 730 735 

acc ggg age aac gac ctg ccg gtg ctg ttt gag ctg gga gag ggc tee 2256 
Thr Gly Ser Asn Asp Leu Pro Val Leu Phe Glu Leu Gly Glu Gly Ser 
740 745 750 

tac tte tec gaa ggg gac ggc tte gcc gag gac ecc acc ate tec ctg 2304 
Tyr Phe Ser Glu Gly Asp Gly Phe Ala Glu Asp Pro Thr He Ser Leu 
755 760 765 

ctg aca ggc teg gag cct ccc aaa gcc aag gac ccc act gtc tec 2349 
Leu Thr Gly Ser Glu Pro Pro Lys Ala Lys Asp Pro Thr Val Ser 
770 775 780 

tagaggccce ggaggagctg ggccagccgc ccaeccccac ccceagtgca gggctggtct 2409 

tggggaggca gggeagcetc geggtcttgg geactggtgg gteggecgg 2458 



<210> 176 
<211> 783 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPP-HSFl 
<400> 176 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 
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Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
^5 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly Leu Arg Ser Arg Ala Gin Ala Ser Asn Ser Ala Val Glu Met Asp 
245 250 255 

Leu Pro Val Gly Pro Gly Ala Ala Gly Pro Ser Asn Val Pro Ala Phe 
260 265 270 

Leu Thr Lys Leu Trp Thr Leu Val Ser Asp Pro Asp Thr Asp Ala Leu 
275 280 285 

He Cys Trp Ser Pro Ser Gly Asn Ser Phe His Val Phe Asp Gin Gly 
290 295 300 

Gin Phe Ala Lys Glu Val Leu Pro Lys Tyr Phe Lys His Asn Asn Met 
305 310 315 320 

Ala Ser Phe Val Arg Gin Leu Asn Met Tyr Gly Phe Arg Lys Val Val 
325 330 335 

His He Glu Gin Gly. Gly Leu Val Lys Pro Glu Arg Asp Asp Thr Glu 
340 345 350 



Phe Gin His Pro Cys Phe Leu Arg Gly Gin Glu Gin Leu Leu Glu Asn 
355 360 365 
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He Lys Arg Lys Val Thr Ser Val Ser Thr Leu Lys Ser Glu Asp He 
370 375 380 

Lys He Arg Gin Asp Ser Val Thr Lys Leu Leu Thr Asp Val Gin Leu 
385 390 395 400 

Met Lys Gly Lys Gin Glu Cys Met Asp Ser Lys Leu Leu Ala Met Lys 
405 410 415 

His Glu Asn Glu Ala Leu Trp Arg Glu Val Ala Ser Leu Arg Gin Lys 
420 425 430 

His Ala Gin Gin Gin Lys Val Val Asn Lys Leu He Gin Phe Leu He 
435 440 445 

Ser Leu Val Gin Ser Asn Arg He Leu Gly Val Lys Arg Lys He Pro 
450 455 460 

Leu Met Leu Asn Asp Ser Gly Ser Ala His Ser Met Pro Lys Tyr Ser 
465 470 475 480 

Arg Gin Phe Ser Leu Glu His Val His Gly Ser Gly Pro Tyr Ser Ala 
485 490 495 

Pro Ser Pro Ala Tyr Ser Ser Ser Ser Leu Tyr Ala Pro Asp Ala Val 
500 505 510 

Ala Ser Ser Gly Pro He He Ser Asp He Thr Glu Leu Ala Pro Ala 
515 520 525 

Ser Pro Met Ala Ser Pro Gly Gly Ser He Asp Glu Arg Pro Leu Ser 
530 535 540 

Ser Ser Pro Leu Val Arg Val Lys Glu Glu Pro Pro Ser Pro Pro Gin 
545 550 555 560 

Ser Pro Arg Val Glu Glu Ala Ser Pro Gly Arg Pro Ser Ser Val Asp 
565 570 575 

Thr Leu Leu Ser Pro Thr Ala Leu He Asp Ser He Leu Arg Glu Ser 
580 585 590 

Glu Pro Ala Pro Ala Ser Val Thr Ala Leu Thr Asp Ala Arg Gly His 
595 600 605 

Thr Asp Thr Glu Gly Arg Pro Pro Ser Pro Pro Pro Thr Ser Thr Pro 
610 615 620 

Glu Lys Cys Leu Ser Val Ala Cys Leu Asp Lys Asn Glu Leu Ser Asp 
625 630 635 640 

His Leu Asp Ala Met Asp Ser Asn Leu Asp Asn Leu Gin Thr Met Leu 
645 650 655 

Ser Ser His Gly Phe Ser Val Asp Thr Ser Ala Leu Leu Asp Leu Phe 
660 665 670 
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Ser Pro Ser Val Thr Val Pro Asp Met Ser Leu Pro Asp Leu Asp Ser 
675 680 685 

Ser Leu Ala Ser He Gin Glu Leu Leu Ser Pro Gin Glu Pro Pro Arg 

695 700 



Pro Pro Glu Ala Glu Asn Ser Ser Pro Asp Ser Gly Lys Gin Leu Val 
710 715 720 

His Tyr Thr Ala Gin Pro Leu Phe Leu Leu Asp Pro Gly Ser Val Asp 
725 730 735 

Thr Gly Ser Asn Asp Leu Pro Val Leu Phe Glu Leu Gly Glu Gly Ser 
740 745 750 

Tyr Phe Ser Glu Gly Asp Gly Phe Ala Glu Asp Pro Thr He Ser Leu 
755 760 765 

Leu Thr Gly Ser Glu Pro Pro Lys Ala Lys Asp Pro Thr Val Ser 
770 775 780 



<210> 177 
<211> 2416 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-NPKB 

<220> 

<221> CDS 

<222> (1) . , (2415) 



<400> 177 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 48 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
^5 10 15 

gtc gag ctg gac ggc gae gta aac ggc cac aag ttc age gtg tee ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gee acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 



35 40 



45 



tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc acc etc gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 



Ctg acc tac ggc gtg cag tgc ttc age cgc tac ccc gae cac atg aag 
Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 



75 80 



cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 



240 



288 
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85 90 9S 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gcc gag 336 
Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lye Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gcc gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gae ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc cee ate ggc gae ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

cee gtg ctg ctg ccc gae aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa.gac ecc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 



gtg acc gee gcc ggg ate act etc ggc atg gac gag ctg tac aag tec 
Val Thr JUa Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 



720 



gga etc aga tet ega gat ceg ccc ttc atg gac gaa ctg ttc ccc etc 768 
Gly Leu Arg Ser Arg Asp Pro Pro Phe Met Asp Glu Leu Phe Pro Leu 
245 250 255 

ate ttc ceg gea gag cca gcc cag gee tet ggc ecc tat gtg gag ate 816 
He Phe Pro Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He 
260 265 270 

att gag cag ccc aag cag egg ggc atg cgc ttc cgc tac aag tge gag 864 
He Glu Gin Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu 
275 280 285 

ggg cgc tec gcg ggc age ate cca ggc gag agg age aca gat acc ace 912 
Gly Arg Ser Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr 
290 295 300 

aag acc cac ccc ace ate aag ate aat ggc tac aca gga cca ggg aca 960 
Lys Thr His Pro Thr He Lys He Asn Gly Tyr Thr Gly Pro Gly Thr 
305 310 315 320 
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gtg cgc ate tec ctg gtc acc aag gac cct cct cac egg cat cac ccc X008 
Val Arg lie Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro 
325 330 335 

cac gag ctt gta gga aag gac tgc egg gat ggc ttc tat gag get gag 1056 
His Glu Leu Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu 
340 345 350 

etc tgc ccg gac cgc tgc ate cac agt ttc cag aac ctg gga ate cag 1104 
Leu Cys Pro Asp Arg Cys He His Ser Phe Gin Asn Leu Gly He Gin 
355 360 365 

tgt gtg aag aag egg gac ctg gag cag get ate agt cag cgc ate cag 1152 
Cys Val Lys Lys Arg Asp Leu Glu Gin Ala He Ser Gin Arg He Gin 
370 375 380 

acc aac aac aac ccc ttc caa gtt cct ata gaa gag cag cgt ggg gac 1200 
Thr Asn Asn Asn Pro Phe Gin Val Pro He Glu Glu Gin Arg Gly Asp 
385 390 395 400 

tac gac ctg aat get gtg egg etc tgc ttc cag gtg aca gtg egg gac 1248 
Tyr Asp Leu Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp 
405 410 415 

cca tea ggc agg ccc etc cgc ctg ccg cct gtc ctt tet cat ccc ate 1296 
Pro Ser Gly Arg Pro Leu Arg Leu Pro Pro Val Leu Ser His Pro He 
420 425 430 

ttt gac aat cgt gee ccc aac act gee gag etc aag ate tgc cga gtg 1344 
Phe Asp Asn Arg Ala Pro Asn Thr Ala Glu Leu Lys He Cys Arg Val 
435 440 445 

aac cga aac tet ggc age tgc etc ggt ggg gat gag ate ttc eta ctg 1392 
Asn Arg Asn Ser Gly Ser Cys Leu Gly Gly Asp Glu He Phe Leu Leu 
450 455 460 

tgt gac aag gtg cag aaa gag gac att gag gtg tat ttc acg gga cca 1440 
Cys Asp Lys Val Gin Lys Glu Asp He Glu Val Tyr Phe Thr Gly Pro 
465 470 475 480 

ggc tgg gag gee cga ggc tec ttt teg caa get gat gtg cac cga caa 1488 
Gly Trp Glu Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin 
485 490 495 

gtg gee att gtg ttc egg acc cct ccc tac gca gac ccc age ctg cag 1536 
Val Ala He Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

get cct gtg cgt gtc tec atg cag ctg egg egg cct tec gac egg gag 1584 
Ala Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

etc agt gag ccc atg gaa ttc cag tac ctg cca gat aca gac gat cgt 1632 
Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 
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cac egg att gag gag aaa cgt aaa agg aca tat gag acc ttc aag age 1680 
HiB Arg He Glu Glu Lys Arg Lye Arg Thr Tyr Glu Thr Phe Lys Ser 

550 555 560 

ate atg aag aag agt cct ttc age gga ccc acc gac ccc egg ect cca 1728 
He Met Lys Lys Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

cct cga cge att get gtg cct tec egc age tea get tct gtc ccc aag 1776 
Pro Arg Arg He Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 590 

cca gca ccc cag ccc tat ccc ttt aeg tea tec ctg age acc ate aac 1824 
Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn 
595 600 605 

tat gat gag ttt ccc acc atg gtg ttt cct tct ggg cag ate age cag 1872 
Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin 
€10 615 620 



gee teg gee ttg gee ccg gee cct ccc caa gtc ctg ccc. cag get cca 
Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
625 630 635 



640 



1920 



gee cct gcc cct get cca gee atg gta tea get ctg gee cag gee cca 1968 
Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro 
645 650 655 

gcc cct gtc cca gtc eta gcc cca gge cct cct cag get gtg gee cca 2016 
Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 670 

cct gcc ccc aag ccc acc cag get ggg gaa gga aeg ctg tea gag gcc 2064 
Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 685 

ctg ctg cag ctg cag ttt gat gat gaa gac ctg ggg gcc ttg ctt gge 2112 
Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly 
690 695 700 

aac age aca gac cca get gtg ttc aca gac ctg gca tec gtc gac aac 2160 
Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
705 710 715 720 

tec gag ttt cag cag ctg ctg aac cag gge ata cct gtg gcc ccc cac 2208 
Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His 
725 730 735 

aca act gag ccc atg ctg atg gag tac cct gag get ata act egc eta 2256 
Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu 
740 745 750 

gtg aca gcc cag agg ccc ccc gac cca get ect get cca ctg ggg gcc 2304 
Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

ccg ggg etc ccc aat gge etc ctt tea gga gat gaa gac ttc tec tec 2352 
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Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 

att gcg gac atg gac ttc tea gee ctg ctg agt cag ate age tec aag 2400 
lie Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser Lvs 
790 795 800 

ggc gaa ttc gaa get t 
Gly Glu Phe Glu Ala 
805 



<210> 178 
<211> 805 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GPP-NFKB 
<400> 178 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 
15 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
^5 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly He Lys Val Asn Phe Lys He firs His Asn He Glu Asp Gly Ser 
165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 
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Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys Ser 
230 235 240 

Gly Leu Arg Ser Arg Asp Pro Pro Phe Met Asp Glu Leu Phe Pro Leu 
245 250 255 

He Phe Pro Ala Glu Pro Ala Gin Ala Ser Gly Pro Tyr Val Glu He 
260 265 270 

He Glu Gin Pro Lys Gin Arg Gly Met Arg Phe Arg Tyr Lys Cys Glu 
275 280 285 

Gly Arg Ser Ala Gly Ser He Pro Gly Glu Arg Ser Thr Asp Thr Thr 
290 295 300 



Lys Thr His Pro Thr He Lys He Asn Gly Tyr Thr Gly Pro Glv Thr 
310 315 320 

Val Arg He Ser Leu Val Thr Lys Asp Pro Pro His Arg Pro His Pro 
325 330 335 

His Glu Leu Val Gly Lys Asp Cys Arg Asp Gly Phe Tyr Glu Ala Glu 
340 345 350 

Leu Cys Pro Asp Arg Cys He His Ser Phe Gin Asn Leu Gly He Gin 
355 360 365 

Cys Val Lys Lys Arg Asp Leu Glu Gin Ala He Ser Gin Arg He Gin 
370 375 380 

Thr Asn Asn Asn Pro Phe Gin Val Pro He Glu Glu Gin Arg Gly Asp 
390 395 

Tyr Asp Leu Asn Ala Val Arg Leu Cys Phe Gin Val Thr Val Arg Asp 
405 410 415 

Pro Ser Gly Arg Pro Leu Arg Leu Pro Pro Val Leu Ser His Pro He 
420 425 430 

Phe Asp Asn Arg Ala Pro Asn Thr Ala Glu Leu Lys He Cys Arg Val 
435 440 445 

Asn Arg Asn Ser Gly Ser Cys Leu Gly Gly Asp Glu He Phe Leu Leu 
450 455 460 

Cys Asp Lys Val Gin Lys Glu Asp He Glu Val Tyr Phe Thr Gly Pro 
465 470 



475 



480 



Gly Trp Glu Ala Arg Gly Ser Phe Ser Gin Ala Asp Val His Arg Gin 
485 490 495 
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Val Ala lie Val Phe Arg Thr Pro Pro Tyr Ala Asp Pro Ser Leu Gin 
500 505 510 

Ala Pro Val Arg Val Ser Met Gin Leu Arg Arg Pro Ser Asp Arg Glu 
515 520 525 

Leu Ser Glu Pro Met Glu Phe Gin Tyr Leu Pro Asp Thr Asp Asp Arg 
530 535 540 

His Arg lie Glu Glu Lys Arg Lys Arg Thr Tyr Glu Thr Phe Lys Ser 
545 550 555 560 

He Met Lys Lya Ser Pro Phe Ser Gly Pro Thr Asp Pro Arg Pro Pro 
565 570 575 

Pro Arg Arg He Ala Val Pro Ser Arg Ser Ser Ala Ser Val Pro Lys 
580 585 590 

Pro Ala Pro Gin Pro Tyr Pro Phe Thr Ser Ser Leu Ser Thr He Asn 
595 600 605 

Tyr Asp Glu Phe Pro Thr Met Val Phe Pro Ser Gly Gin He Ser Gin 
610 615 620 

Ala Ser Ala Leu Ala Pro Ala Pro Pro Gin Val Leu Pro Gin Ala Pro 
625 630 635 640 

Ala Pro Ala Pro Ala Pro Ala Met Val Ser Ala Leu Ala Gin Ala Pro 
€45 650 655 

Ala Pro Val Pro Val Leu Ala Pro Gly Pro Pro Gin Ala Val Ala Pro 
660 665 670 

Pro Ala Pro Lys Pro Thr Gin Ala Gly Glu Gly Thr Leu Ser Glu Ala 
675 680 685 

Leu Leu Gin Leu Gin Phe Asp Asp Glu Asp Leu Gly Ala Leu Leu Gly 
690 695 700 

Asn Ser Thr Asp Pro Ala Val Phe Thr Asp Leu Ala Ser Val Asp Asn 
''OS 710 715 720 

Ser Glu Phe Gin Gin Leu Leu Asn Gin Gly He Pro Val Ala Pro His 
725 730 735 

Thr Thr Glu Pro Met Leu Met Glu Tyr Pro Glu Ala He Thr Arg Leu 
740 745 750 

Val Thr Ala Gin Arg Pro Pro Asp Pro Ala Pro Ala Pro Leu Gly Ala 
755 760 765 

Pro Gly Leu Pro Asn Gly Leu Leu Ser Gly Asp Glu Asp Phe Ser Ser 
770 775 780 

He Ala Asp Met Asp Phe Ser Ala Leu Leu Ser Gin He Ser Ser Lys 
"785 790 795 800 



161 



wo 00/50872 



PCT/USOO/04794 



Gly Glu Phe Glu Ala 
605 



<210> 179 
<211> 1677 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 

<220> 

<221> CDS 

<222> (1) ..(1674) 

<400> 179 

atg ttc cag gcg get gag cgc ccc cag gag tgg gcc atg gag ggc ccc 48 
Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
15 10 15 

cgc gac ggg ctg aag aag gag egg eta ctg gac gac cgc cac gac age 96 
Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Aep Asp Arg His Asp Ser 
20 25 30 

ggc ctg gac tec atg aaa gac gag gag tac gag cag atg gtc aag gag 144 
Gly Leu Asp Ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 
35 40 45 

ctg cag gag ate cgc etc gag ceg cag gag gtg ccg cgc ggc teg gag 192 
Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 
50 55 60 

CCC tgg aag cag cag etc ace gag gac ggg gac teg ttc ctg cae ttg 240 
Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 
65 70 75 80 

gee ate ate cat gaa gaa aag gca ctg ace atg gaa gtg ate cgc cag 288 
Ala lie lie His Glu Glu Lys Ala Leu Thr Met Glu Val lie Arg Gin 
85 90 95 

gtg aag gga gac ctg gcc ttc etc aac etc cag aac aac ctg cag cag 336 
Val Lys Gly Asp Leu Ala Phe Leu Asn Leu Gin Asn Asn Leu Gin Gin 
100 105 110 

act cca etc cac ttg get gtg ate ace aac cag cea gaa att get gag 384 
Thr Pro Leu His Leu Ala Val lie Thr Asn Gin Pro Glu He Ala Glu 
115 120 125 

gca ctt ctg gga get ggc tgt gat cct gag etc ega gac ttt cga gga 432 
Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Gly 
130 135 140 

aat acc ccc eta cac ctt gee tgt gag cag ggc tge ctg gcc age gtg 480 
Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
145 150 155 160 
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gga gtc ctg act cag tec tgc acc acc ccg cac etc cac tee ate ttg 528 
Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser He Leu 
165 170 

aag get acc aac tac aat ggc cac acg tgt eta cac tta gee tct ate 576 
Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser He 
180 185 190 

r^^ ^^"^ ""^^ 9tg tec ttg ggt get gat 624 

His Gly Tyr Leu Gly He Val Glu Leu Leu Val Ser Leu Gly Ala Asp 

200 205 

gtc aat get cag gag ecc tgt aat ggc egg act gee ett cac etc gea 672 
Val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 

gtg gac ctg caa aat cct gae ctg gtg tea etc ctg ttg aag tgt ggg 720 
Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu Leu Lys Cys Gly 
230 235 240 

get gat gtc aac aga gtt acc tac cag ggc tat tct ecc tac cag etc 768 
Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr Ser Pro Tyr Gin Leu 
245 250 255 

acc tgg ggc cgc cca age acc egg ata cag cag cag ctg ggc cag ctg 816 
Thr Trp Gly Arg Pro Ser Thr Arg lie Gin Gin Gin Leu Gly Gin Leu 
260 265 270 

aca eta gaa aac ett cag atg ctg cca gag agt gag gat gag gag age 864 
Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 285 

tat gae aca gag tea gag tte acg gag tte aca gag gae gag ctg ecc 912 
Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
290 295 300 

tat gat gae tgt gtg ttt gga ggc cag cgt ctg acg tta ace ggt atg 960 
Tyr Asp Asp Cys Val Phe Gly Gly Gin Arg Leu Thr Leu Thr Gly Met 
310 315 320 

get age aaa gga gaa gaa etc tte act gga gtt gtc cca att ett gtt 1008 
Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
325 330 335 

gaa tta gat ggt gat gtt aac ggc cac aag tte tct gtc agt gga gag 1056 
Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
340 345 350 

ggt gaa ggt gat gea aca tac gga aaa ett ace ctg aag tte ate tgc 1104 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
355 360 365 

act act ggc aaa ctg cct gtt cca tgg cca aca eta gtc act act ctg 1152 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

380 



370 375 
tgc tat ggt gtt caa tgc ttt tea aga tac ccg gat eat atg aaa egg 



1200 
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Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 
390 395 400 

11^ ^^'^ ^""'^ '''''' 9*^^ cag gaa agg 1248 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Qlu Arg 
405 410 415 

acc ate ttc ttc aaa gat gac gge aac tac aag aca cgt get gaa gtc 1296 
Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
420 425 430 

aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 1344 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
435 440 445 

gac ttc aag gaa gat ggc aac att ctg gga cac aaa ttg gaa tac aac 1392 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
450 455 460 

tat aac tea cac aat gta tac ate atg gca gac aaa caa aag aat gga 1440 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Glv 
470 475 480 

ate aaa gtg aac ttc aag ace cge cac aac att gaa gat gga age gtt 1488 
He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser Val 
485 490 495 

caa eta gca gac cat tat caa caa aat act cca att ggc gat ggc ect 1536 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
500 505 510 

gtc ctt tta cca gac aac cat tac ctg tec aca caa tct gcc ctt teg 1584 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
515 520 525 

aaa gat cec aac gaa aag aga gac cac atg gtc ctt ctt gag ttt gta 1632 
Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
530 535 540 

aca get get ggg att aca eat gge atg gat gaa ctg tac aac tag 1677 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
545 550 555 

<210> 180 
<211> 558 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: GFP-IKB 
<400> 180 

Met Phe Gin Ala Ala Glu Arg Pro Gin Glu Trp Ala Met Glu Gly Pro 
1 5 10 15 

Arg Asp Gly Leu Lys Lys Glu Arg Leu Leu Asp Asp Arg His Asp Ser 
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20 25 30 

Gly Leu Asp ser Met Lys Asp Glu Glu Tyr Glu Gin Met Val Lys Glu 

35 ' * 



40 45 
Leu Gin Glu lie Arg Leu Glu Pro Gin Glu Val Pro Arg Gly Ser Glu 



55 60 
Pro Trp Lys Gin Gin Leu Thr Glu Asp Gly Asp Ser Phe Leu His Leu 



70 



75 



SO 



Ala He He His Glu Glu Lys Ala Leu Thr Met Glu Val He Arq Gin 
85 90 



95 

Val Lys Gly Asp Leu Ala Phe Leu Asn Leu Gin Asn Asn Leu Gin Gin 
100 105 

Thr Pro Leu His Leu Ala Val He Thr Asn Gin Pro Glu He Ala Glu 
115 120 125 

Ala Leu Leu Gly Ala Gly Cys Asp Pro Glu Leu Arg Asp Phe Arg Glv 
130 135 ' 

Asn Thr Pro Leu His Leu Ala Cys Glu Gin Gly Cys Leu Ala Ser Val 
" ISO 155 160 

Gly Val Leu Thr Gin Ser Cys Thr Thr Pro His Leu His Ser He Leu 
165 170 175 

Lys Ala Thr Asn Tyr Asn Gly His Thr Cys Leu His Leu Ala Ser He 
100 185 190 

His Gly Tyr Leu Gly He Val Glu Leu Leu Val Ser Leu Gly Ala Asp 
195 200 205 

val Asn Ala Gin Glu Pro Cys Asn Gly Arg Thr Ala Leu His Leu Ala 
210 215 220 



Val Asp Leu Gin Asn Pro Asp Leu Val Ser Leu Leu Leu Lys Cys Gly 
230 235 240 

Ala Asp Val Asn Arg Val Thr Tyr Gin Gly Tyr Ser Pro Tyr Gin Leu 
245 250 255 

Thr Trp Gly Arg Pro Ser Thr Arg He Gin Gin Gin Leu Gly Gin Leu 



260 



265 



270 



Thr Leu Glu Asn Leu Gin Met Leu Pro Glu Ser Glu Asp Glu Glu Ser 
275 280 



285 



Tyr Asp Thr Glu Ser Glu Phe Thr Glu Phe Thr Glu Asp Glu Leu Pro 
290 295 300 

Tyr Asp Asp cys Val Phe Gly Gly Gin Arg Leu Thr Leu Thr Gly Met 



310 



315 320 
Ala Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 
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325 



330 



335 



Glu Leu Aep Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu 
340 345 350 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
355 360 365 

Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
370 375 380 

Cys Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Arg 

390 395 400 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
405 410 415 

Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
420 425 430 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
435 440 445 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr Asn 
450 455 460 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn Gly 
465 470 475 480 

He Lys Val Asn Phe Lys Thr Arg His Asn He Glu Asp Gly Ser Val 
485 490 495 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
500 505 510 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser .Thr Gin Ser Ala Leu Ser 
515 520 525 



Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val 
530 535 540 

Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Asn 
545 550 555 
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